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Preface 


Functional analysis has become a sufficiently large area of mathematics that 
it 1S possible to find two research mathematicians, both of whom call 
themselves functional analysts, who have great difficulty understanding the 
work of the other. The common thread is the existence of a linear space with 
a topology or two (or more). Here the paths diverge in the choice of how 
that topology is defined and in whether to study the geometry of the linear 
space, or the linear operators on the space, or both. 

In this book I have tried to follow the common thread rather than any 
special topic. I have included some topics that a few years ago might have 
been thought of as specialized but which impress me as interesting and 
basic. Near the end of this work I gave into my natural temptation and 
included some operator theory that, though basic for operator theory, might 
be considered specialized by some functional analysts. 

The word “course” in the title of this book has two meanings. The first is 
obvious. This book was meant as a text for a graduate course in functional 
analysis. The second meaning is that the book attempts to take an excursion 
through many of the territories that comprise functional analysis. For this 
purpose, a choice of several tours is offered the reader—whether he is a 
tourist or a student looking for a place of residence. The sections marked 
with an asterisk are not (strictly speaking) necessary for the rest of the book, 
but will offer the reader an opportunity to get more deeply involved in the 
subject at hand, or to see some applications to other parts of mathematics, 
or, perhaps, just to see some local color. Unlike many tours, it is possible to 
retrace your steps and cover a starred section after the chapter has been left. 

There are some parts of functional analysis that are not on the tour. Most 
authors have to make choices due to time and space limitations, to say 
nothing of the financial resources of our graduate students. Two areas that 
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are only briefly touched here, but which constitute entire areas by them- 
selves, are topological vector spaces and ordered linear spaces. Both are 
beautiful theories and both have books which do them justice. 

The prerequisites for this book are a thoroughly good course in measure 
and integration— together with some knowledge of point set topology. The 
appendices contain some of this material, including a discussion of nets in 
Appendix A. In addition, the reader should at least be taking a course in 
analytic function theory at the same time that he is reading this book. From 
the beginning, analytic functions are used to furnish some examples, but it 
is only in the last half of this text that analytic functions are used in the 
proofs of the results. 

It has been traditional that a mathematics book begin with the most 
general set of axioms and develop the theory, with additional axioms added 
as the exposition progresses. To a large extent I have abandoned tradition. 
Thus the first two chapters are on Hilbert space, the third is on Banach 
spaces, and the fourth is on locally convex spaces. To be sure, this causes 
some repetition (though not as much as I first thought it would) and the 
phrase “the proof is just like the proof of ...” appears several times. But I 
firmly believe that this order of things develops a better intuition in the 
student. Historically, mathematics has gone from the particular to the 
general—not the reverse. There are many reasons for this, but certainly one 
reason is that the human mind resists abstraction unless it first sees the need 
to abstract. 

I have tried to include as many examples as possible, even if this means 
introducing without explanation some other branches of mathematics (like 
analytic functions, Fourier series, or topological groups). There are, at the 
end of every section, several exercises of varying degrees of difficulty with 
different purposes in mind. Some exercises just remind the reader that he is 
to supply a proof of a result in the text; others are routine, and seek to fix 
some of the ideas in the reader’s mind; yet others develop more examples; 
and some extend the theory. Examples emphasize my idea about the nature 
of mathematics and exercises stress my belief that doing mathematics 1s the 
way to learn mathematics. 

Chapter I discusses the geometry of Hilbert spaces and Chapter II begins 
the theory of operators on a Hilbert space. In Sections 5—8 of Chapter II, 
the complete spectral theory of normal compact operators, together with a 
discussion of multiplicity, is worked out. This material 1s presented again in 
Chapter IX, when the Spectral Theorem for bounded normal operators is 
proved. The reason for this repetition is twofold. First, I wanted to design 
the book to be usable as a text for a one-semester course. Second, if the 
reader understands the Spectral Theorem for compact operators, there will 
be less difficulty in understanding the general case and, perhaps, this will 
lead to a greater appreciation of the complete theorem. 

Chapter III is on Banach spaces. It has become standard to do some of 
this material in courses on Real Variables. In particular, the three basic 
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principles, the Hahn-Banach Theorem, the Open Mapping Theorem, and 
the Principle of Uniform Boundedness, are proved. For this reason I 
contemplated not proving these results here, but in the end decided that 
they should be proved. I did bring myself to relegate to the appendices the 
proofs of the representation of the dual of L? (Appendix B) and the dual of 
Co( X) (Appendix C). 

Chapter IV hits the bare essentials of the theory of locally convex spaces 
—enough to rationally discuss weak topologies. It is shown in Section 5 that 
the distributions are the dual of a locally convex space. 

Chapter V treats the weak and weak-star topologies. This is one of my 
favorite topics because of the numerous uses these ideas have. 

Chapter VI looks at bounded linear operators on a Banach space. 
Chapter VII introduces the reader to Banach algebras and spectral theory 
and applies this to the study of operators on a Banach space. It is in 
Chapter VII that the reader needs to know the elements of analytic function 
theory, including Liouville’s Theorem and Runge’s Theorem. (The latter is 
proved using the Hahn—Banach Theorem in Section III.8.) 

When in Chapter VIII the notion of a C*-algebra is explored, the 
emphasis of the book becomes the theory of operators on a Hilbert space. 

Chapter IX presents the Spectral Theorem and its ramifications. This is 
done in the framework of a C*-algebra. Classically, the Spectral Theorem 
has been thought of as a theorem about a single normal operator. This it is, 
but it is more. This theorem really tells us about the functional calculus for 
a normal operator and, hence, about the weakly closed C*-algebra gener- 
ated by the normal operator. In Section IX.8 this approach culminates in 
the complete description of the functional calculus for a normal operator. In 
Section [X.10 the multiplicity theory (a complete set of unitary invariants) 
for normal operators is worked out. This topic is too often ignored in books 
on operator theory. The ultimate goal of any branch of mathematics is to 
classify and characterize, and multiplicity theory achieves this goal for 
normal operators. 

In Chapter X unbounded operators on Hilbert space are examined. The 
distinction between symmetric and self-adjoint operators is carefully delin- 
eated and the Spectral Theorem for unbounded normal operators is ob- 
tained as a consequence of the bounded case. Stone’s Theorem on one 
parameter unitary groups is proved and the role of the Fourier transform in 
relating differentiation and multiplication is exhibited. 

Chapter XI, which does not depend on Chapter X, proves the basic 
properties of the Fredholm index. Though it is possible to do this in the 
context of unbounded operators between two Banach spaces, this material is 
presented for bounded operators on a Hilbert space. 

There are a few notational oddities. The empty set is denoted by 0. A 
reference number such as (8.10) means item number 10 in Section 8 of the 
present chapter. The reference (IX.8.10) is to (8.10) in Chapter IX. The 
reference (A.1.1) is to the first item in the first section of Appendix A. 
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There are many people who deserve my gratitude in connection with 
writing this book. In three separate years I gave a course based on an 
evolving set of notes that eventually became transfigured into this book. The 
students in those courses were a big help. My colleague Grahame Bennett 
gave me several pointers in Banach spaces. My ex-student Marc Raphael 
read final versions of the manuscript, pointing out mistakes and making 
suggestions for improvement. Two current students, Alp Eden and Paul 
McGuire, read the galley proofs and were extremely helpful. Elena Fraboschi 
typed the final manuscript. 
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CHAPTER I 
Hilbert Spaces 


A Hilbert space is the abstraction of the finite-dimensional Euclidean spaces 
of geometry. Its properties are very regular and contain few surprises, 
though the presence of an infinity of dimensions guarantees a certain 
amount of surprise. Historically, it was the properties of Hilbert spaces that 
guided mathematicians when they began to generalize. Some of the proper- 
ties and results seen in this chapter and the next will be encountered in more 
general settings later in this book, or we shall see results that come close to 
these but fail to achieve the full power possible in the setting of Hilbert 
space. 


§1. Elementary Properties and Examples 


Throughout this book F will denote either the real field, R, or the complex 
field, C. 


1.1. Definition. If 2 is a vector space over F, a semi-inner product on & is 
a function u: 2X #— F such that for all a, 8 in F and x, y,z in &, the 
following are satisfied: 


(a) u(ax + By, z) = au(x,z) + BuCy, z), 
(b) u(x,ay + Bz) = au(x, y) + Bu(x, Zz), 
(c) u(x, x)= 0, 

(d) u(x, y) = u(y, x). 


Here, for a in F, a = aif F = R and a is the complex conjugate of a if 
F =C. If a€C, the statement that a > 0 means that a € R and a is 
non-negative. 
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Note that if a = 0, then property (a) implies that u(0, y) = u(a- 0, y) = 
au(O, y) = 0 for all y in 2. This and similar reasoning shows that for a 
semi-inner product u, 


(e) u(x,0) = u(0, y)=0 forall x,y in &. 


In particular, u(0, 0) = 0. 
An inner product on & is a semi-inner product that also satisfies the 
following: 


(f) If u(x, x) = 0, then x = 0. 
An inner product in this book will be denoted by 
(x,y) = u(x, y). 


There is no universally accepted notation for an inner product and the 
reader will often see (x, y) and (x|y) used in the literature. 


1.2. Example. Let % be the collection of all sequences {a,: n > 1} of 
scalars a, from F such that a, = 0 for all but a finite number of values of 
n. If addition and scalar multiplication are defined on 2 by 


{On} +{ Bi} = (on + By} 
a{ a, } = { aa, }, 
then Z is a vector space over F. 


If u({a,},{B,}) = Lx 167,>,, then u is a semi-inner product that is 
not an inner product. On the other hand, 


({a,}.{B,}) = LD a8, 


({a,},(By}) = DE —0,B,. 


n=1 
(LO, }> {Bn }> — dL WanBr 


all define inner products on 2%. 


1.3. Example. Let (X, 2,4) be a measure space consisting of a set X, a 
o-algebra 2 of subsets of X, and a countably additive R U {co} valued 
measure p defined on Q. If f and g © L?(p) = L’(X, Q, ), then Hélder’s 
inequality implies fg € L'(). If 


(f.8) = [fedn, 


then this defines an inner product on L?(11). 

Note that Hélder’s inequality also states that |{fgdp| < [f\f]* du]'”* - 
[flg|* du]’”?. This is, in fact, a consequence of the following result on 
semi-inner products. 
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1.4. The Cauchy—Bunyakowsky—Schwarz Inequality. Jf (- -) is a semi- 
inner product on 2, then 


(xs yD S (x, xy, Y) 
for all x and y in &. 
PROOF. If a € F and x and y € &, then 
O< (x -—ay,x-—ay) 
= (x, x) — af y,x) — a(x, y) + aly, y). 


Suppose (y,x) = be”, b>0, and let a=e t, t in R. The above 
inequality becomes 


0 < (x,x) —e “the — ethe~ + ty, y) 
= (x,x) — 2bt+ ty, y) 
=c— 2bt+ at? =4q(t), 


where c = (x,x) and a= (y, y). Thus q(t) is a quadratic polynomial in 
the real variable ¢ and g(t) > 0 for all ¢. This implies that the equation 
q(t) = 0 has at most one real solution ¢t. From the quadratic formula we 
find that the discriminant is not positive; that is, 0 > 4b? — 4ac. Hence 


0 >b*—ac= Ix, y)I? — (X,X)CYsY), 
proving the inequality. a 


The inequality in (1.4) will be referred to as the CBS inequality. 
1.5. Corollary. If ¢ - ,-) is a semi-inner product on & and \\x|| = (x, xy'/ 


for all x in &, then 


(a) ||x + yl] < [lxl] + Ibvll for x, y in &, 
(b) |jax|| = |a| ||x|| for ain F and x in &. 


If «+ ,+) is an inner product, then 
(c) ||x|| = O implies x = 0. 


PROOF. The proofs of (b) and (c) are left as an exercise. To see (a), note that 
for x and y in @, 


IIx + yl? = (x + y,x + y) 
= |lxll? + Cy, x) + (x5) + III? 
= |[x||? + 2Re(x, y) + Il’. 
By the CBS inequality, Re(x, y) < |(x, y>| < ||| ||y||. Hence, 
lx + yl? < [ell? + 2flell Ill + Ill? 


Z 
= (Ilx|] + Ill)”. 
The inequality now follows by taking square roots. be 
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If (-,-) is a semi-inner product on % and if x, y © %, then as was 
shown in the preceding proof, 

IIx + yll? = [xi]? + 2Re(x, y) + ly’. 
This identity is often called the polar identity. 

The quantity ||x|| = (x,x)'/* for an inner product ¢- ,-) is called the 
norm of x. If £= F4 (R4 or C%) and ({a,},{8,}) = X4_,a,86,, then the 
corresponding norm is ||{a, }|] = [22_,\a,|7]'7. 

The virtue of the norm on a vector space 4 is that d(x, y) = ||x — y|| 
defines a metric on % [by (1.5)] so that 2 becomes a metric space. In fact, 
d(x, y) = |lx — yll = Ix — 2) + (2 — yi < Ie — zi + Mz — yl = 
d(x,z)+d(z, y). The other properties of a metric follow similarly. If 
& = F@ and the norm is defined as above, this distance function is the usual 
Euclidean metric. 


1.6. Definition. A Hilbert space is a vector space # over F together with 
an inner product ¢ - , -) such that relative to the metric d(x, y) = ||x — y|| 
induced by the norm, # is a complete metric space. 


If #= L?() and <f,g) = f{fgdp, then the associated norm is |lf]| = 
[if]? du]'/?. It is a standard result of measure theory that L*(m) is a 
Hilbert space. It is also easy to see that F% is a Hilbert space. 


REMARK. The inner products defined on L?() and F% are the “ usual” ones. 
Whenever these spaces are discussed these are the inner products referred 
to. The same is true of the next space. 


1.7. Example. Let J be any set and let /*(/) denote the set of all functions 
x: I> F such that x(i) = 0 for all but a countable number of i and 
LY, < 71x(i)|? < oo. For x and y in /*(/) define 


(x, ¥) = Lx(i)y(i). 


Then /*(/) is a Hilbert space (Exercise 2). 

If J =N, /7(/) is usually denoted by /*. Note that if Q = the set of all 
subsets of J and for E in Q, p(E)= oo if E is infinite and p(£) = the 
cardinality of E if E is finite, then /7(/) and L?(J,Q, «) are equal. 

Recall that an absolutely continuous function on the unit interval [0, 1] 
has a derivative a.e. on [0, 1]. 


1.8. Example. Let = the collection of all absolutely continuous func- 
tions f: [0,1] > F such that f(0)=0 and f’e L*(0,1). If (f,g) = 
[of (g(t) at for f and g in #, then # is a Hilbert space (Exercise 3). 


Suppose % is a vector space with an inner product ( - ,-) and the norm 
is defined by the inner product. What happens if (2%, d) (d(x, y) = ||x — yll) 
is not complete? 
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1.9. Proposition. Jf % is a vector space and ¢ - ,-) is an inner product on 
4 and if # is the completion of & with respect to the metric induced by the 
norm on &, then there is an inner product (-,-), on # such that 
(X,Y) v= (X, Ye for x and y in 2 and the metric on # is induced by this 
inner product. That is, the completion of 2 is a Hilbert space. 


The preceding result says that an incomplete inner product space can be 
completed to a Hilbert space. It is also true that a Hilbert space over R can 
be imbedded in a complex Hilbert space (see Exercise 7). 

This section closes with an example of a Hilbert space from analytic 
function theory. 


1.10. Definition. If G is an open subset of the complex plane C, then 
L(G) denotes the collection of all analytic functions f: G > C€ such that 


ff vo + iy)|? dx dy < c. 


L(G) is called the Bergman space for G. 


Several alternatives for the integral with respect to two-dimensional 
Lebesgue measure will be used. In addition to [{,f(x + iy) dxdy we will 
also see 


[Js and [ faArea. 
G 


Note that L2(G) Cc L*(u), where p = Area|G, so that L2(G) has a 
natural inner product and norm from L?(,). 


1.11. Lemma. I/f f is analytic in a neighborhood of B(a;r), then 


[Here B(a;r) = {z: |z — a| <r} and B(a;r)= {z: |z-—a| <r}.] 


PROOF. By the mean value property, if0 <t<r, f(a)=(1/27){7 fla + 
te'®) d@. Hence 


(mr?) "ff 


f= (mr) fi} f° f(a +t te’) d@| dt 


(a;r) 
= (2/r?) f'f(a)de= f(a). 
1.12. Corollary. If f € L2(G), a © G, and 0 <r < dist(a, dG), then 


y(a)| < ells 
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PRooF. Since B(a;r) C G, the preceding lemma and the CBS inequality 


imply 
Foal 


i 2 
si, N 


ayr) 


f(a) = — 


1/2 172 
Sh8 


1 
<—sllfl.rv7. 
7r 


1.13. Proposition. L2(G) is a Hilbert space. 


Proor. If » = area measure on G, then L?(u) is a Hilbert space and 
L2(G) C L?*(). So it suffices to show that L2(G) is closed in L*(u). Let 
{ f,} be a sequence in L2(G) and let f € L*(u) such that {|f, — f|’ du > 0 
aS Nn —> 00. 

Suppose B(a;r) C€ G and let 0 < p < dist(B(a;r), dG). By the preced- 
ing corollary there is a constant C such that |f,(z) — f,,(z)| < Cllf, — Aull, 
for all n,m and for |z — a| < p. Thus { f,} is a uniformly Cauchy sequence 
on any closed disk in G. By standard results from analytic function theory 
(Montel’s Theorem or Morera’s Theorem, for example), there is an analytic 
function g on G such that f(z) > g(z) uniformly on compact subsets of 
G. But since {|f, — f\?du — 0, a result of Riesz implies there is a subse- 
quence {f, } such that f, (z) > f(z) ae. [uw]. Thus f= g ae. [u] and so 


feliv). 


EXERCISES 
1. Verify the statements made in Example 1.2. 
2. Verify that /*(/) (Example 1.7) is a Hilbert space. 
3. Show that the space # in Example 1.8 1s a Hilbert space. 
4 


. Describe the Hilbert spaces obtained by completing the space 2 in Example 1.2 
with respect to the norm defined by each of the inner products given there. 


5. (A variation on Example 1.8) Let n > 2 and let #= the collection of all 
functions f: [0,1] > F such that (a) f(0) = 0; (b) forl <k <n-1, f™ (1) 
exists for all ¢ in [0,1] and f‘*? is continuous on [0,1]; (c) f°"~) is absolutely 
continuous and f‘”) € L7(0,1). For f and g in #, define 


(Ag= bd [P(g (0) at. 
k=1°9 


Show that # is a Hilbert space. 
6. Let u be a semi-inner product on % and put V= {x © #: u(x, x) = 0}. 


(a) Show that / is a linear subspace of 2. 
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(b) Show that if 
(xt+WV,y+NM) =u(x,y) 


for all x + W and y+ in the quotient space 2//, then (-,-) is a 
well-defined inner product on 2/Y. 


7. Let # be a Hilbert space over R and show that there is a Hilbert space ¥ over 
C and a map U: #—> & such that (a) U is linear; (b) (Uh,,Uh,) = ¢h,,h,) 
for all h,,h, in #; (c) for any k in & there are unique h,,h, in # such that 
k = Uh, + iUh,. (# is called the complexification of #.) 


8. If G={z EC: 0 < |z| <1} show that every f in L2(G) has a removable 
singularity at z = 0. 


9. Which functions are in L*(C)? 


10. Let G be an open subset of C and show that if a € G, then { f € L7(G): 
f(a) = 0} is closed in L2(G). 


11. If {h,,} is a sequence in a Hilbert space # such that L,,||h,,|| < oo, then show 
that L°°_,h, converges in #%. 


§2. Orthogonality 


The greatest advantage of a Hilbert space is its underlying concept of 
orthogonality. 


2.1. Definition. If # is a Hilbert space and f,¢g © #, then f and g are 
orthogonal if <f, g) = 0. In symbols, f 1g. If A,B C#, then A 1 B if 
f £ g for every f in A and g in B. 


If o?= R?’, this is the correct concept. Two non-zero vectors in R? are 
orthogonal precisely when the angle between them is 7/2. 


2.2. The Pythagorean Theorem. /f f,, f,,...,f, are pairwise orthogonal 
vectors in F#, then 


I + a tee till? = WAN? + AIP + FAA? 
ProoF. If f, 1 f,, then 


I + All? = (A+ fof +f) = WAI + 2Re(f, A) + IIA? 


by the polar identity. Since f, L f,, this implies the result for n = 2. The 
remainder of the proof proceeds by induction and 1s left to the reader. iz] 


Note that if f 1 g, then f 1 —g, so |If — gl? = ||Al* + |[g/|°?. The next 
result is an easy consequence of the Pythagorean Theorem if f and g are 
orthogonal, but this assumption is not needed for its conclusion. 
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2.3. Paralilelogram Law. If # is a Hilbert space and f and g © #, then 


If + gil? + If — gil? = 2QUA? + Iigil’). 
PROOF. For any f and g in & the polar identity implies 


If + gil? = IAI? + 2Re(f,g) + lIgil’, 


If — gil? = IAI? — 2Re(f, g) + Ilgil. 
Now add. J 


The next property of a Hilbert space is truly pivotal. But first we need a 
geometric concept valid for any vector space over F. 


2.4. Definition. If 2 is any vector space over F and A C &, then A is a 
convex set if for any x and yin A andO <t<1l,wx+(1—ft)yeEa. 


Note that {tx + (1 — t)y: 0 < ¢ < 1} 1s the straight-line segment joining 
x and y. So a convex set is a set A such that if x and y € A, the entire line 
segment joining x and y is contained in A. 

If @ is a vector space, then any linear subspace in % is a convex set. A 
singleton set is convex. The intersection of any collection of convex sets is 
convex. If # is a Hilbert space, then every open ball B( f;r) = {g © #: 
If — gl| <r} 1s convex, as is every closed ball. 


2.5. Theorem. [f # is a Hilbert space, K is a closed convex nonempty 
subset of #, and h © #, then there is a unique point k, in K such that 


\|2 — ko|] = dist(h, K) = inf{|A — kl: k E K}. 
Proof. By considering K —h = {k —h: k © K } instead of K, it suffices 
to assume that h = 0. (Verify!) So we want to show that there is a unique 
vector k, in K such that 
[Koll = dist(0, K ) = inf{||A||: k @ K}. 
Let d= dist(0, K ). By definition, there is a sequence {k,,} in K such that 
||A,,|| — d. Now the Parallelogram Law implies that 


k, —k,,\l° k, +k,, 


Since K is convex, 3(k, + k,,) © K. Hence, ||3(k, + k,,)||° => d?. If e > 0, 
choose N such that for n > N, ||k,||?7 < d? + 4e*. By the equation above, if 
n,m > N, then 


2 


2 
< 4(2d? + 4e*) —d? = he’. 


2 
Thus, ||k, — k,,|| < ¢ for n,m > N and {k,,} is a Cauchy sequence. Since 
H#H is complete and K is closed, there is a k, in K such that ||k, — ko|| > 0. 
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Also for all k,, 
AS<||Koll = |Ko — k, + k,l 


Thus Koll =d. 
To prove that ky is unique, suppose hy © K such that ||h|| = d. By 
convexity, (ky + hy) € K. Hence, 


d <|[5(Ao + Koll < a(llAoll + IlKoll) < a. 
So ||5(49 + ko)|| = d. The Parallelogram Law implies 


ho + ko hyo — Ko 
vi 2 


Zz 2 


bd 


ie 


-a?—| 


hence h, = Ko. a 


If the convex set in the preceding theorem is in fact a closed linear 
subspace of #, more can be said. 


2.6. Theorem. /f 4 is a closed linear subspace of #, h © #, and f, is the 
unique element of 4 such that \\|h — f,|| = dist(h,Z), then h — f, 1 4%. 
Conversely, if fy © M such that h — f, 1 &, then \|h — fo|| = dist(h, Z). 


PROOF. Suppose fp © 4 and ||h — f,|| = dist(h, Z). If f eZ, then f, + f 


EM and so ||h — foll’ < lh — (fo + AI? = IA — fo) — A? = WA - fall’ 
— 2Re(h — fy, f) + |Ifl’. Thus 


2Re(h — fo, f) < IAI’ 


for any f in @. Fix f in .@ and substitute ref for f in the preceding 
inequality, where (h — fy, f) = re'®, r > 0. This yields 2 Re{ te re'®} < 
t*\|fll*, or 2tr < r°||f|. Letting > 0, we see that r = 0; that is, h — fy  f. 

For the converse, suppose fy € 4 such that h — f, 1 @. If fe, then 
h—-f,1tfo —f so that 


I|A =| = Ie — fo) +( fo — fil’ 
= \(4= fol? + |Lfo 29" 
> ||h — fol’. 
Thus ||h — fol] = dist(h, 7). a 
IfACH#, let A*={feEH#: fi g forall g in A}. It is easy to see that 
A~ is a closed linear subspace of #. 
Note that Theorem 2.6, together with the uniqueness statement in Theo- 
rem 2.5, shows that if 4 is a closed linear subspace of # and h € #, then 


there is a unique element f, in 4 such that h — f, © .@~* . Thus a function 
P: #°— M can be defined by Ph = fo. 
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2.7. Theorem. If 4 is a closed linear subspace of # andh © #, let Ph be 
the unique point in M such that h — Ph 1 M. Then 


(a) P is a linear transformation on #, 

(b) ||Ph|| < ||A|| for every h in #, 

(c) P* = P (here P* means the composition of P with itself ), 
(d) kerP = @~ and ran P = 4%. 


PROOF. Keep in mind that for every h in #, h — Ph © M~ and |\|h — Ph 
= dist(h, Z@). 


(a) Let h,,h, €= 4 and a,,a,<F. If fe, then ([a,h, + a,h,] - 
[a,Ph, + a,Phy], f) = a,(h, — Phy, f) + a (hy, — Phy, f) = 0. By 
the uniqueness statement of (2.6), P(ah, + a,h,) = a,Ph, + a,Phy. 

(b) If he #, then h=(h—- Ph)+ Ph, Phe M, and h—- Phe. 
Thus ||A\|? = || — Phj|? + ||PA\|? > ||PAII?. 

(c) If f €.4, then Pf = f. For any h in #, Ph €.M; hence P*h = P( Ph) 
= Ph. That is, P* = P. 

(d) If Ph = 0, then h=h—- Phe.m@*. Conversely, if h ©. 4~, then 0 is 
the unique vector in .@ such that h -0=h 14. Therefore Ph = 0. 
That ran P = “1s clear. a 


2.8. Definition. If 7 is a closed linear subspace of # and P is the linear 
map defined in the preceding theorem, then P is called the orthogonal 
projection of # onto 4. If we wish to show this dependence of P on 4, we 
will denote the orthogonal projection of # onto @ by Py. 


It also seems appropriate to introduce the notation 4 < # to signify 
that .@ is a closed linear subspace of #. We will use the term Jinear 
manifold to designate a linear subspace of # that is not necessarily closed. 
A linear subspace of # will always mean a closed linear subspace. 


2.9. Corollary. If 4 < #, then (M*)* =. 


ProoF. If J is used to designate the identity operator on # (viz., Ih = h) 
and P = P,, then J — P is the orthogonal projection of # onto @~ 
(Exercise 2). By part (d) of the preceding theorem, (.4 *)* = ker(J — P). 
But 0 = (J — P)h iff h = Ph. Thus (.@~+)*=ker(/ — P)= ran P =.4%. 

a 


2.10. Corollary. [fA C #, then (A+)~ is the closed linear span of A in #. 


The proof is left to the reader; see Exercise 4 for a discussion of the term 
“closed linear span.” 


2.11. Corollary. If Y is a linear manifold in 3, then Y is dense in # iff 
Y - = (0). 


PROOF. Exercise. 
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EXERCISES 


1. Let # be a Hilbert space and suppose f, g € # with ||/|| = ||g|| = 1. Show that 
j¢f + (1 — t)g|| < 1 for 0 < t < 1. What does this say about {h © #: ||h|| < 1}? 


2. If @<+# and P= P,, show that J — P is the orthogonal projection of # 
onto “*. 


3. If MH <H#, show that WO.M* = (0) and every h in # can be written as 
h =f +g where fe and geM* If M4+M*={(f,2): fEM, 2 EM*) 
and T: 4+ M+ — # is defined by T(f, g) = f + g, show that T is a linear 
bijection and a homeomorphism if 4+-.4+ is given the product topology. 
(This is usually phrased by stating that M and 4+ are topologically complemen- 
tary in #.) 


4. If A C #, let VA = the intersection of all closed linear subspaces of # that 
contain A. VA is called the closed linear span of A. Prove the following: 


(a) VA < # and VA is the smallest closed linear subspace of # that con- 
tains A. 
(b) VA = the closure of {L7_,a,f,: n=1, a, EF, f, € A}. 


5. Prove Corollary 2.10. 
6. Prove Corollary 2.11. 


§3. The Riesz Representation Theorem 


The title of this section is somewhat ambiguous as there are at least two 
Riesz Representation Theorems. There is one so-called theorem that repre- 
sents bounded linear functionals on the space of continuous functions on a 
compact Hausdorff space. That theorem will be discussed later in this book. 
The present section deals with the representation of certain linear function- 
als on Hilbert space. But first we have a few preliminaries to dispose of. 


3.1. Proposition. Let # be a Hilbert space and L: #-F a linear 
functional. The following statements are equivalent. 


(a) L is continuous. 

(b) L is continuous at 0. 

(c) L is continuous at some point. 

(d) There is a constant c > QO such that |L(h)| < c|\h|| for every hin #. 


PROOF. It is clear that (a) => (b) = (c) and (d) = (b). Let’s show that 
(c) = (a) and (b) = (d). 

(c) = (a): Suppose L is continuous at h, and h is any point in #. If 
h,7hin #, then h,-—h+h, > ho. By assumption, L(h,) = lim[L(A,, 
—h+h,)| = lim[L(h,) — L(A) + L(A,)] = lim L(A,) — L(h) + L(hy). 
Hence L(h) = lim L(h,). 
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(b) = (d): The definition of continuity at 0 implies that L~‘({a & F: 
|a| < 1}) contains an open ball about 0. So there is a 6 > 0 such that 
B(O; 6) c L-*‘({a € F: |a| < 1}). That is, ||A|| < 6 implies |L(A)| < 1. If h 
is an arbitrary element of # and e > 0, then ||8((|A|| + e)~h|| < 6. Hence 


oh 6 
1>{L 
Oioesd 


[al +e 
1 
IL(h)| < = (Wall + €). 


IL(A)|; 


thus, 


Letting « — 0 we see that (d) holds with c = 1/6. a 


3.2. Definition. A bounded linear functional L on # is a linear functional 
for which there is a constant c > 0 such that |L(h)| < c\|h|| for all h in #%. 
In light of the preceding proposition, a linear functional is bounded if and 
only if it is continuous. 


For a bounded linear functional L: #- F, define 
|L|| = sup{|L(A)|: lhl] < 1}. 
Note that by definition, |/L|| < 00; ||Z|| is called the norm of L. 


3.3. Proposition. Jf L is a linear functional, then 
WLI = sup{|L(A)|: Wall = 1) 
= sup{|L(A)|/||Al|: h € 2, h # O} 
=inf{c>0:|L(A)| <cllhl|, hin #3}. 
Also, |L(h)| < ||£|| \|A\| for every hin #. 


PRrooF. Let a = inf{c > 0: ||L(h)|] < cllA||, 4 in % }. It will be shown that 
||Z|| = a; the remaining equalities are left as an exercise. If « > 0, then the 
definition of ||Z|| shows that |L((\|A|| + e)~*h)| < ||L||. Hence |L(A)| < 
[LCA] + €). Letting « — 0 shows that |L(A)| < ||L]| ||A|| for all h. So the 
definition of a shows that a < ||L||. On the other hand, if |Z(/)| < eljhl| 
for all h, then ||L|| < c. Hence ||Z|| < a. i 


Fix an hh, in # and define L: #- F by L(h) = (A, hg). It is easy to 
see that L is linear. Also, the CBS inequality gives that |L(A)| = |¢h, ho)| 
< |All ||Aol]. So L is bounded and ||L|| < ||Aol|. In fact, L(Ag/|IAoll) = 
(ho/||Aoll, Ao) = |lAoll, So that ||L]| = ||A ||. The main result of this section 
provides a converse to these observations. 


3.4. The Riesz Representation Theorem. /f L: #—> F is a bounded linear 
functional, then there is a unique vector hy in # such that L(h) = ¢h, hy) 
for every hin #. Moreover, ||L\| = ||Aoll- 
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Proor. Let .4=kerL. Because L is continuous, .4 1s a closed linear 
subspace of #. Since we may assume that “4 # #, M~ # (0). Hence there 
is a vector f, in @~ such that L(f)) = 1. Nowif h € # and a = L(h), 
then L(h — af.) = L(h)-a=0; soh — L(h)f, © 4. Thus 


OS <h= L(h) fos fo» 


= (hy fo) — L(AYIfoll?- 


So if hy = |Ifoll~7fo, L(A) = (h, hy) for all h in #. 

If hg © # such that (h,h,)) = ¢(h, ho) for all h, then hy — ho 1 #. In 
particular, hy — ho 1 hy — ho and so ho = ho. The fact that ||Z]| = ||| 
was shown in the discussion preceding the theorem. a 


3.5. Corollary. If (X,Q,u) is a measure space and F: L?(w)—> F is a 
bounded linear functional, then there is a unique h, in L?() such that 


F(h) = [hhody 
for every h in L?(1). 


Of course the preceding corollary is a special case of the theorem on 
representing bounded linear functionals on L?(p), 1 < p < oo. But it is 
interesting to note that it is a consequence of the result for Hilbert space 
[and the result that L*() is a Hilbert space]. 


EXERCISES 
1. Prove Proposition 3.3. 


2. Let #= /°(N). If N>1 and L: #- F is defined by L({ a, }) = ay, find the 
vector hy in # such that L(h) = (h,h,) for every h in #. 


3. Let #= /7(N U {0}). (a) Show that if {«,} € /’, then the power series L%_ ya, 2” 
has radius of convergence > 1. (b) If |A| <1 and L: #- C is defined by 
L({a,, }) = L_oa,, A", find the vector hy in # such that L(h) = (h,h,) for 
every h in #. (c) What is the norm of the linear functional L defined in (b)? 


4. With the notation as in Exercise 3, define L: #— C by L({a,,}) = L°%_,na,A" |, 
where |A| < 1. Find a vector hy in # such that L(h) = (h,h,) for every h 
in #. 


5. Let # be the Hilbert space described in Example 1.8. If 0 < ¢ < 1, define L: 
H— F by L(h) = h(t). Show that L is a bounded linear functional, find ||L]|, 
and find the vector hy in # such that L(h) = (h,h,) for all h in #. 


6. Let #= L*(0,1) and let C“ be the set of all continuous functions on [0,1] that 
have a continuous derivative. Let ¢ € [0,1] and define L: C“) > F by L(h) = 
h’(t). Show that there is no bounded linear functional on # that agrees with L 
on C"), 
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§4. Orthonormal Sets of Vectors and Bases 


It will be shown in this section that, as in Euclidean space, each Hilbert 
space can be coordinatized. The vehicle for introducing the coordinates is 
an orthonormal basis. The corresponding vectors in F“% are the vectors 
{€,,@5,...,e,}, where e, is the d-tuple having a 1 in the kth place and 
zeros elsewhere. 


4.1. Definition. An orthonormal subset of a Hilbert space # is a subset & 
having the properties: (a) for e in @, |le|| = 1; (b) if e,,e, © & and e, # e,, 
then e, 1 e,. 

A basis for # 1s a maximal orthonormal set. 


Every vector space has a Hamel basis (a maximal linearly independent 
set). The term “basis” for a Hilbert space is defined as above and it relates 
to the inner product on #. For an infinite-dimensional Hilbert space, a 
basis is never a Hamel basis. This is not obvious, but the reader will be able 
to see this after understanding several facts about bases. 


4.2. Proposition. If & is an orthonormal set in #, then there is a basis for H 
that contains €. 


The proof of this proposition is a straightforward application of Zorn’s 
Lemma and is left to the reader. 


4.3. Example. Let = L2[0,27] and for n in Z define e, in # by 
e,(t) = (27) '/exp(int). Then {e,: n © Z} is an orthonormal set in #%. 
(Here L2[0, 27] is the space of complex-valued square integrable functions.) 


It is also true that the set in (4.3) is a basis, but this is best proved after a 
bit of theory. 


4.4. Example. If #= F¢ and for 1 < k < d, e, = the d-tuple with 1 in the 
kth place and zeros elsewhere, then {e,,...,e,} 18 a basis for #. 


4.5. Example. Let #= /*(/) as in Example 1.7. For each i in J define e, 
in # by e,(i) = 1 and e,( 7) = 0 for j # i. Then {e,;: i € J} is a basis. 


The proof of the next result is left as an exercise (see Exercise 5). It is very 
useful but the proof is not difficult. 


4.6. The Gram—Schmidt Orthogonalization Process. [f # is a Hilbert 
space and {h,: n © N} is a linearly independent subset of #, then there is 
an orthonormal set {e,: n © N)} such that for every n, the linear span of 
{e,,..-,e€,} equals the linear span of {hy,...,h,}. 
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Remember that VA is the closed linear span of A (Exercise 2.4). 


4.7. Proposition. Let {e,,...,e,} be an orthonormal set in # and let 
M=N{e,,...,e,}. If P is the orthogonal projection of # onto M, then 


Ph= > (he, ye, 
k=1 


for allh in #. 


Proor. Let Qh = Li_i(h,e,e,. If 1 <j <n, then (Qh,e,) = 
wah, ey) {C,,€;) = Ch,e;) since e, 1 e, for k # j. Thus (h — Qh,e,) 
= 0 for 1 <j <n. That is, h — Qh 1 @ for every h in #. Since Qh is 
clearly a vector in 4, Qh is the unique vector h, in @ such that 
h—-h, +4 (2.6). Hence Qh = Ph for every h in #. | 


4.8. Bessel’s Inequality. If {e,: n <= N} is an orthonormal set andh © #, 
then 


 lKh,e, 1? < |All. 
n=1 


PROOF. Let h, =h — L2_,¢h,e,)e,. Then h, 1 e, for1 < k <n (Why?). 
By the Pythagorean Theorem, 
2 


All? = [A II? + 


> (h, ene x 
k=1 


= |[A,\ + dD he, 
k=1 


n 
2 
a a) 
k=1 
Since 1 was arbitrary, the result is proved. | 


4.9. Corollary. If & is an orthonormal set in # andh © #, then ¢h,e) # 0 
for at most a countable number of vectors e in @. 


PROOF. For each n>1 let & ={e€@: |¢h,e)| => 1/n}. By Bessel’s 
Inequality, &, is finite. But U"_.@ = {e & @: ¢h,e,) # O}. iz 


4.10. Corollary. Jf & is an orthonormal set andh © #, then 
Li MKh,e)|? < {IAll’. 


e€E& 


This last corollary is just Bessel’s Inequality together with the fact (4.9) 
that at most a countable number of the terms in the sum differ from zero. 
Actually, the sum that appears in (4.10) can be given a better interpreta- 
tion—a mathematically precise one that will be useful later. The question is, 
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what is meant by L{h,: i€ 1} if h; © 4 and I is an infinite, possibly 
uncountable, set? Let F be the collection of all finite subsets of 7 and order 
F by inclusion, so ¥ becomes a directed set. For each F in #, define 


he= {hie FY}. 


Since this is a finite sum, fh, is a well-defined element of #7. Now {h,: 
FeF}isanetin #. 


4.11. Definition. With the notation above, the sum L{h;: i © I} converges 
if the net {h,: F © F } converges; the value of the sum is the limit of the 
net. 


If #= F, the definition above gives meaning to an uncountable sum of 
scalars. Now Corollary 4.10 can be given its precise meaning; namely, 
Y{|(h,e)|*: e € &} converges and the value < ||h||* (Exercise 9). 

If the set J in Definition 4.11 is countable, then this definition of 
convergent sum is not the usual one. That is, if {h,,} is a sequence in #, 
then the convergence of L{h,: n © N} is not equivalent to the convergence 
of £%°_,h,. The former concept of convergence is that defined in (4.11) while 
the latter means that the sequence {L7_,h, }?°_, converges. Even if #’= F, 
these concepts do not coincide (see Exercise 12). If, however, .{h,: n = N} 
converges, then °°_,h, converges (Exercise 10). Also see Exercise 11. 


4.12. Lemma. If & is an orthonormal set and h © #, then 


Vi{ih,evye:eE 6} 


converges in XH. 


Proor. By (4.9), there are vectors e,,e,,... in & such that {ee @: 
(he) # 0} = {e,,e5, ... }. We also know that L°_,|(h, e,,)|* < ||Al|* < 0. 
So if e>0, there is an N such that D°%,|¢h,e,)|° < e*. Let Fy= 
{e,,---,@y_,} and let #= all the finite subsets of &. For F in F define 
h, = XU{¢h,e\e: e€ F}. If F and G © F and both contain Fo, then 


Ae hl’ = Lif{lkkh,e)|*: e © (F\G) U(G\ F)} 


<s \(h,e,) | 
n=N 


2 
pe oe 

So {h,;: F © F } is a Cauchy net in #. Because # is complete, this net 

converges. In fact, it converges to L°°_,(h,e,,e,,- iw 


4.13. Theorem. If & is an orthonormal set in 3, then the following 
statements are equivalent. 


(a) & is a basis for #. 


(b) [fh € # andh 1 @, thenh = 0. 
(c) VE= #. 
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(d) [fh © H#H, thenh = L{ch,eve: eE é}. 
(e) Ifgandh © #, then 


(gh) = Li{(g.e){e,h): e€ 6}. 
(f) Ifh © #, then |\h\\? = U{|¢(h, e)|*: e € &} (Parseval’s Identity). 


PROOF. (a) = (b): Suppose h 1 @ and h#0; then @U{A/||A||} is an 
orthonormal set that properly contains @, contradicting maximality. 

(b) = (c): By Corollary 2.11, V6 = # if and only if & + = (0). 

(b) = (d): If hE #, then f=h— L{ch,eye: e & &} is a well-defined 
vector by Lemma 4.12. If e, € &, then (f,e,) = (h,e,) — L{ch,e) (e, e,): 
e&é&}= ¢h,e,) — (h,e,) = 0. That is, fe &~. Hence f = 0. (Is every- 
thing legitimate in that string of equalities? We don’t want any illegitimate 
equalities.) 

(d) = (e): This is left as an exercise for the reader. 

(e) = (f): Since ||A||? = (h, h), this is immediate. 

(f) = (a): If @ is not a basis, then there is a unit vector eg (||eo|| = 1) in 
H such that e, 1 & Hence, 0 = L{|(eo,e)|7: e € &}, contradicting (f). 

= 


Just as in finite-dimensional spaces, a basis in Hilbert space can be used 
to define a concept of dimension. For this purpose the next result is pivotal. 


4.14. Proposition. [f # is a Hilbert space, any two bases have the same 
cardinality. 


Proor. Let & and ¥ be two bases for # and put e = the cardinality of @, 
n = the cardinality of 4%. If e or 7 is finite, then e = y (Exercise 15). 
Suppose both e and 7 are infinite. For e in &, lett A, = {fEF: (e,f) # 
0}; so ¥, is countable. By (4.13b), each f in ¥ belongs to at least one set 
F,, e in &. That iss F=U{LF: ee &}. Hence y < €- 8, = «. Similarly, 
Ee < 7. a 


4.15. Definition. The dimension of a Hilbert space is the cardinality of a 
basis and is denoted by dim #. 


If (.X, d) is a metric space that is separable and { B, = B(x;; €;): i © I} 1s 
a collection of pairwise disjoint open balls in X, then J must be countable. 
Indeed, if D is a countable dense subset of X, B,M D #0 for each i in J. 
Thus there is a point x, in B,M D. So {x;: i € I} is a subset of D having 
the cardinality of J; thus J must be countable. 


4.16. Proposition. [f # is an infinite-dimensional Hilbert space, then # is 
separable if and only if dim #= & 9. 
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Proor. Let & be a basis for #. If e,,e, € &, then |le, — e,||? = |le,|I’ 
\|e5||> = 2. Hence { B(e;1/ v2): e€ é } is a collection of pairwise disjoint 
open balls in #. From the discussion preceding this proposition, the 
assumption that #% is separable implies @ is countable. The converse is an 
exercise. Bi 


EXERCISES 

. Verify the statements in Example 4.3. 

. Verify the statements in Example 4.4. 

. Verify the statements in Example 4.5. 

. Find an infinite orthonormal set in the Hilbert space of Example 1.8. 


nN BB WwW NO F 


. Using the notation of the Gram—Schmidt Orthogonalization Process, show that 
up to scalar multiple e, = h,/||A,|| and for n > 2, e, = ||A, — f,ll (A, —f,), 
where f, is the vector defined formally by 

(hy, Ay) 7s (Ay-15 1) (hy» hy) 
-1 : : : 
~ det[ (A, i 1 ‘| (hy, h, 1) a Chy-1> 8-1) Nysla) 
h, Lo Vis 9) 


n 


In the next three exercises, the reader is asked to apply the Gram—Schmidt 
Orthogonalization Process to a given sequence in a Hilbert space. A reference for 
this material is pp. 82-96 of Courant and Hilbert [1953]. 


6. If the sequence 1,x,x?,... is orthogonalized in L?(—1,1), the sequence 
e, (x) = [3(2n + 1)]'”*P, (x) is obtained, where 
ae n 
P(x) = a o(q) G2-0", 
The functions P(x) are called Legendre polynomials. 


7. If the sequence e7*/2, xe /?2, x2e7/?, 1 is orthogonalized in 


L*(— 00,00), the sequence e,(x) = 2"nWa 72H, (x)e*'/2 is obtained, 
where 
H,(x) = (-)"e"{f) ew. 
: dx 
The functions H,, are Hermite polynomials and satisfy H/(x) = 2nH,,_,(x). 


8. If the sequence e */*, xe */*, x*e” */?, ... is orthogonalized in L*(0, 00), the 


sequence e,(x) =e */7L,(x)/n! is obtained, where 


L,(x) = e(<] (xe). 
The functions L,, are called Laguerre polynomials. 
9. Prove Corollary 4.10 using Definition 4.11. 


10. If {h,,} is a sequence in Hilbert space and L{h,: n © N} converges to h 
(Definition 4.11), then lim,27_jh, = h. Show that the converse is false. 
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11. If {h,,} is a sequence in a Hilbert space and L?_,||h,,|| < 00, show that L{A,: 
n © N} converges in the sense of Definition 4.11. 


12. Let {a,} be a sequence in F and prove that the following statements are 
equivalent: (a) L{a,: n © N} converges in the sense of Definition 4.11. (b) If 7 
is any permutation of N, then L?_,a,,,,, converges (unconditional convergence). 
(C) Lr=ila,| < 0. 


13. Let & be an orthonormal subset of # and let W = V@. If P is the orthogonal 
projection of # onto 4, show that Ph =L{ch,e)e: e€@} for every h 
in #. 


14. Let A = Area measure on D and show that 1, z,z’, ... are orthogonal vectors 
in L*(A). Find ||z"||, 2 > 0. If e, = |lz"||7'z", n = 0, is {e),e,, ...} a basis 
for L7(A)? 


15. In the proof of (4.14), show that if either ¢ or 7 is finite, then ¢ = n. 


16. If # is an infinite-dimensional Hilbert space, show that no orthonormal basis 
for # is a Hamel basis. Show that a Hamel basis is uncountable. 


17. Let d>1 and let p be a regular Borel measure on R“. Show that L?() is 
separable. 


18. Suppose L?(X,2,p) is separable and { E,: i € I} is a collection of pairwise 
disjoint subsets of X, E, © 2, and 0 < p(E,) < oo for all i. Show that / is 
countable. Can you allow p(E;) = 00? 


19. If {h € #: ||h|| < 1} is compact, show that dim #< oo. 
20. What is the cardinality of a Hamel basis for /7? 


§5. Isomorphic Hilbert Spaces and the Fourier 
Transform for the Circle 


Every mathematical theory has its concept of isomorphism. In topology 
there is homeomorphism and homotopy equivalence; algebra calls them 
isomorphisms. The basic idea is to define a map which preserves the basic 
structure of the spaces in the category. 


5.1. Definition. If #7” and % are Hilbert spaces, an isomorphism between 
H and # is a linear surjection U: #- X such that 


(Uh, Ug) = (h, 8) 
for all h, g in #. In this case # and & are said to be isomorphic. 
It is easy to see that if U: #— & is an isomorphism, then so is U7!: 


AH — #H. Similar such arguments show that the concept of “isomorphic” is 
an equivalence relation on Hilbert spaces. It is also certain that this is the 
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correct equivalence relation since an inner product is the essential ingredient 
for a Hilbert space and isomorphic Hilbert spaces have the “same” inner 
product. One might object that completeness is another essential ingredient 
in the definition of a Hilbert space. So it is! However, this too is preserved 
by an isomorphism. An isometry between metric spaces is a map that 
preserves distance. 


5.2. Proposition. Jf V: #’— & is a linear map between Hilbert spaces, then 
V is an isometry if and only if (Vh,Vg) = ¢h, g) for all h, gin #. 


Proor. Assume (Vh,Vg) = ¢h,g) for all h,g in #. Then ||Vhl|* = 
(Vh,Vh) = ¢h,h) = ||h\|? and V is an isometry. 

Now assume that V is an isometry. If h,g<@# and AEF, then 
[2 + Ag||? = ||Vh + AVg||?. Using the polar identity on both sides of this 
equation gives 


Al]? + 2ZRerACh, g) + [AL Ilgi|? = || VAI? + 2ReACVh, Vg) + IAL? || Vell’. 
But ||VhA|| = ||A|| and ||Vgl| = ||g||, so this equation becomes 
RedA¢h, g) = ReA(Vh, Vg) 


for any in F. If F = R, take A = 1. If F = C, first take A = 1 and then 
take A =i to find that (h,g) and (Vh,Vg) have the same real and 
imaginary parts. =z 


Note that an isometry between metric spaces maps Cauchy sequences into 
Cauchy sequences. Thus an isomorphism also preserves completeness. That 
is, if an inner product space is isomorphic to a Hilbert space, then it must be 
complete. 


5.3. Example. Define S: /* > /* by S(a,, a), ...) = (0, a), @), ...). Then 
S is an isometry that is not surjective. 


The preceding example shows that isometries need not be isomorphisms. 

A word about terminology. Many call what we call an isomorphism a 
unitary operator. We shall define a unitary operator as a linear transforma- 
tion U: #— # that is a surjective isometry. That is, a unitary operator is 
an isomorphism whose range coincides with its domain. This may seem to 
be a minor distinction, and in many ways it is. But experience has taught me 
that there is some benefit in making such a distinction, or at least in being 
aware of it. 


5.4. Theorem. Two Hilbert spaces are isomorphic if and only if they have the 
same dimension. 


ProoF. If U: #— # is an isomorphism and @ is a basis for #, then it is 
easy to see that U@ = {Ue: e € &} isa basis for #. Hence, dim #= dim #. 
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Let # be a Hilbert space and let & be a basis for #. Consider the 
Hilbert space /7(&). If h € #7, define h: € > F by h(e)= (h,e). By 
Parseval’s Identity h € /°(€) and |A|| = ||A||. Define U: 22> 1°(&) by 
Uh =h. Thus U is linear and an isometry. It is easy to see that ranU 
contains all the functions f in /*(&) such that f(e) = 0 for all but a finite 
number of e; that is, ranU is dense. But U, being an isometry, must have 
closed range. Hence U: #- /?(@&) is an isomorphism. 

If & is a Hilbert space with a basis #, % is isomorphic to /7(F). If 
dim #= dim # , € and F have the same cardinality; it is easy to see that 
1?(&) and /?(F ) must be isomorphic. Therefore # and % are isomorphic. 

a 


5.5. Corollary. All separable infinite dimensional Hilbert spaces are isomor- 
phic. 


This section concludes with a rather important example of an isomor- 
phism, the Fourier transform on the circle. 

The proof of the next result can be found as an Exercise on p. 263 of 
Conway [1978]. Another proof will be given later in this book after the 
Stone—Weierstrass Theorem is proved. So the reader can choose to assume 
this for the moment. Let D = {z € C: |z| < 1}. 


5.6. Theorem. Jf f: 0D —C is a continuous function, then there is a 
sequence { p,(z,Z)} of polynomials in z and Zz such that p,(z,Zz) > f(z) 
uniformly on OD. 


Note that if z © 0D, 7 =z~'. Thus a polynomial in z and Z on 0D 
becomes a function of the form 


If we put z = e”’, this becomes a function of the form 


n 
y a,ei* 
k=-—m 
Such functions are called trigonometric polynomials. 
We can now show that the orthonormal set in Example 4.3 is a basis for 
Lé[0, 27]. This is a rather important result. 


5.7. Theorem. If for each n in Z, e,(t) = (27) '/exp(int), then {e,: 
n € Z} is a basis for Lé[0, 27]. 


PROOF. Let F= {(L7__,a,e,: a, © C, n => 0}. Then 7 is a subalgebra of 
C,[0, 27], the algebra of all continuous C-valued functions on [0, 277]. Note 
that if f<= 7, f(0) = f(27). We want to show that the uniform closure of 7 
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is @= {f © C, [0,27]: f(0) = f(27)}. To do this, let f © @ and define F: 
oD > C by F(e")= f(t). F is continuous. (Why?) By (5.6) there is a 
sequence of polynomials in z and 2, { p,(z,Zz)}, such that p,(z,z) > F(z) 
uniformly on dD. Thus p,(e",e°")— f(t) uniformly on [0,27]. But 
pleie “VET. 

Now the closure of @ in L2[0,27] is all of Lé[0, 27] (Exercise 6). Hence 
V{e,: n © Z} = Lé[0,27] and {e,} is thus a basis (4.13). & 


Actually, it is usually preferred to normalize the measure on [0, 27]. That 
is, replace dt by (27) ‘dt, so that the total measure of [0,27] is 1. Now 
define e,(¢) = exp(int), Hence {e,: ne Z} is a basis for = 
L2({0, 27], (27) * dt). If f € #, then 


A = = 1 27 Jit 
5.8 f(m) = (fen) = a2 [Flee ade 
is called the nth Fourier coefficient of f, n in Z. By (5.7) and (4.13d), 
5.9 f= Ss f(n)e,, 


where this infinite series converges to f in the metric defined by the norm of 
H. This is called the Fourier series of f. This terminology is classical and 
has been adopted for a general Hilbert space. 

If # is any Hilbert space and @ is a basis, the scalars {(h,e);e € &} 
are called the Fourier coefficients of h (relative to &) and the series in 
(4.13d) is called the Fourier expansion of h (relative to &). 

Note that Parseval’s Identity applied to (5.9) gives that °° l(a)? < 
oo. This proves a classical result. 


5.10. The Riemann-Lebesgue Lemma. Jf f € L’[0,27], then 
[o7f(tje i dt > 0 asn > +00. 


If f © L2{0, 27], then the Fourier series of f converges to f in L*-norm. 
It was conjectured by Lusin that the series converges to f almost every- 
where. This was proved in Carleson [1966]. Hunt [1967] showed that if 
f © Lé[0,27], 1 < p < oo, then the Fourier series also converges to f a.e. 
Long before that, Kolmogoroff had furnished an example of a function f in 
L;-[0, 27] whose Fourier series does not converge to f a.e. 

For f in L¢é[0,27], the function f: Z — C is called the Fourier transform 
of f; the map U: L2[0,27]— /°(Z) defined by Uf =f is the Fourier 
transform. The results obtained so far can be applied to this situation to 
yield the following. 


5.11. Theorem. The Fourier transform is a linear isometry from L2[0,27] 
onto 1*(Z). 
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Proor. Let U: L2[0,27] > /°(Z) be the Fourier transform. That U maps 
L? = L2[0,27] into /7(Z) and satisfies ||Uf|| = ||/| is a consequence of 
Parseval’s Identity. That U is linear is an exercise. If {a,} © /*(Z) and 
a,, = 0 for all but a finite number of n, then f= L°%__ Va,e, © L*. It is 


easy to check that f(n) = a, for all n, so Uf = {a,,}. Thus ranU is dense in 
/*, But U is an isometry, so ranU is closed; hence U is surjective. | 


Note that functions in Lé[0,27] can be defined on dD by letting 
f(e'’) = f(@). The ambiguity for 6 = 0 and 27 (or e’ = 1) might cause us 
to pause, but remember that elements of L2[0,27] are equivalence classes 
of functions—not really functions. Since {0,27} has zero measure, there is 
really no ambiguity. In this way Lé[0,27] can be identified with L2( dD), 
where the measure on 0D is normalized arc-length measure (normalized so 
that the total measure of 0D is 1). So L2[0,27] and L2( dD) are (naturally) 
isomorphic). Thus, Theorem 5.11 is a theorem about the Fourier transform 
of the circle. 

The importance of Theorem 5.11 is not the fact that L7[0,27] and /*(Z) 
are isomorphic, but that the Fourier transform is an isomorphism. The fact 
that these two spaces are isomorphic follows from the abstract result that all 
separable infinite dimensional Hilbert spaces are isomorphic (5.5). 


EXERCISES 
1. Verify the statements in Example 5.3. 


2. Define V: L*(0,00) > L?(0,00) by (Vf)(t) = f(t + 1). Show that V is an 
isometry that is not surjective. 


3. Define V: L*(R) > L?(R) by (Vf)(t) = f(t + 1) and show that V is an isomor- 
phism (a unitary operator). 


4. Let # be the Hilbert space of Example 1.8 and define U: #> L?(0,1) by 
Uf = f’. Show that U is an isomorphism and find a formula for U~!. 


5. Let (X, 2%, w) be a o-finite measure space and let u: X — F be an (2-measurable 
function such that sup{|u(x)|: x € X}< oo. Show that U: L*(X,2,p) > 
L*(X, Q, 4) defined by Uf = uf is an isometry if and only if |u(x)| = 1 ae. [p], 
in which case U is surjective. 


6. Let = {f © C[0,27]: f(0) = f(27)} and show that ¢ is dense in L7[0, 22]. 


7. Show that {(1/ V2), (1/ V7 )cos nt,(1/ V7 )sin nt: 1 <n < oo} is a basis for 
L*[-, 7]. 


8. Let (X,{2) be a measurable space and let y,v be two measures defined on 
(X,02). Suppose » <p and ¢ is the Radon—Nikodym derivative of v with 
respect to p (¢ = dv/d). Define V: L?(v) > L?(p) by Vf = Vof. Show that V 
is a well-defined linear isometry and V is an isomorphism if and only if p « v 
(that is, » and v are mutually absolutely continuous). 
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§6. The Direct Sum of Hilbert Spaces 


Suppose # and & are Hilbert spaces. We want to define #@® %& so that it 
becomes a Hilbert space. This is not a difficult assignment. For any vector 
spaces 2 and Y¥, #® Y is defined as the Cartesian product 2 x Y where 
the operations are defined on 2 X Y coordinatewise. That is, if elements of 
X® Y are defined as{x @y: x EB, ye Y}, then (x, ® y,) + (x2 ® Jp) 
= (x, + x,) ®(y, + y,), and so on. 


6.1. Definition. If # and % are Hilbert spaces, #@ H= {h@k: he #, 
k © X#} and 


Ch, ®k,,h, ® ky) = (hy, hy) + (ky, ko). 


It must be shown that this defines an inner product on #©@ # and that 
H-® H is complete (Exercise). 

Now what happens if we want to define #7, © #, ® --- for a sequence 
of Hilbert spaces #,, %,, ...? There is a problem about the completeness 
of this infinite direct sum, but this can be overcome as follows. 


6.2. Proposition. If #%,, %,,... are Hilbert spaces, let H= {(h,,)7_3: 
h, €H foralln and LV°_,\\|h,||\° < 0}. Forh = (h,) and g =(g,) in #, 
define 


6.3 ho) = Do Mas B a 
n=] 


Then <-,-) is an inner product on # and the norm relative to this inner 
product is \|h\| = [X%,||A,||?]'/*. With this inner product # is a Hilbert 
space. 


Proor. If h = (h,,) and g=(g,) & #, then the CBS inequality implies 
El (has Bnd < LM Wall < CMA gll2)'7Lligall2)'/2 < 00. Hence the series 
in (6.3) converges absolutely. The remainder of the proof is left to the 
reader. = 


6.4. Definition. If 7,, %, ... are Hilbert spaces, the space # of Proposi- 
tion 6.2 is called the direct sum of #,, #,, ... and is denoted by #= #, 
BH, O::-. 


This is part of a more general process. If {#%: i © I} is a collection of 
Hilbert spaces, #= ®{ #: i € I} is defined as the collection of functions 
h: I>U{#%: i€ 1} such that h(i)€% for all i and L{{\A(i)|\?: 
iE€l}<o.lfh,geH#, (h,g) =L{c(h(i), gi): 1 I}; 4 isa Hilbert 


space. 
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The main reason for considering direct sums is that they provide a way of 
manufacturing operators on Hilbert space. In fact, Hilbert space is a rather 
dull subject, except for the fact that there are numerous interesting ques- 
tions about the linear operators on them that are as yet unresolved. This 
subject is introduced in the next chapter. 


EXERCISES 


1. Let {(X;, Q;,u;): i © I} be a collection of measure spaces and define X, 22, and 
p as follows. Let X = the disjoint union of {X,;: ie J} and let Q= {AC X: 
An X,€Q, for all i}. For A in Q put p(A4)=2L,4,(A X,). Show that 
(X,Q,) is a measure space and L*(X, 2, 1) is isomorphic to ® { L?(X,, Q,, 1): 
ie}. 


2. Let (X, 2) be a measurable space, let j1,, 4, be measures defined on (X, §2), and 
put =p, + p,. Show that the map V: L*(X,2,") > L’?(X, Q,p,) ® 
L*(X,2,,) defined by Vf =f, © f,, where f; is the equivalence class of 
L7(X,Q,p ,) corresponding to f, is well defined, linear, and injective. Show that 
U is an isomorphism iff », and p, are mutually singular. 


CHAPTER II 


Operators on Hilbert Space 


A large area of current research interest is centered around the theory of 
operators on Hilbert space. Several other chapters in this book will be 
devoted to this topic. 

There is a marked contrast here between Hilbert spaces and the Banach 
spaces that are studied in the next chapter. Essentially all of the information 
about the geometry of Hilbert space is contained in the preceding chapter. 
The geometry of Banach space lies in darkness and has attracted the 
attention of many talented research mathematicians. However, the theory of 
linear operators (linear transformations) on a Banach space has very few 
general results, whereas Hilbert space operators have an elegant and well- 
developed general theory. Indeed, the reason for this dichotomy is related to 
the opposite status of the geometric considerations. Questions concerning 
operators on Hilbert space don’t necessitate or imply any geometric difficul- 
ties. 

In addition to the fundamentals of operators, this chapter will also 
present an interesting application to differential equations in Section 6. 


§1. Elementary Properties and Examples 


The proof of the next proposition is similar to that of Proposition I.3.1 and 
is left to the reader. 


1.1. Proposition. Let # and X be Hilbert spaces and A: #—> X a linear 
transformation. The following statements are equivalent. 


(a) A is continuous. 
(b) A is continuous at 0. 
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(c) A is continuous at some point. 
(d) There is a constant c > 0 such that \|Ah|| < c||h|| for allh in #. 


As in (1.3.3), if 
||A|| = sup{|Ahl|: h © #, ||hI| < 1}, 


then 


||Al] = sup{||AAl|: |||] = 1} 
sup {||Ah||/|lA||: 4 # 0} 


inf{c > 0: ||AA|| < cllh||, hin 7}. 


Also, ||AA|| < ||A]| ||A]|. ||A|| 1s called the norm of A and a linear transfor- 
mation with finite norm is called bounded. Let @(#’, X ) be the set of 
bounded linear transformations from # into #. For #= XH, B(H#, HH) 
= B( HH). Note that 4(#,F) = all the bounded linear functionals on #. 


1.2. Proposition. (a) If A and BE B(#H, Xx), thn A+ BEB H,2X), 
and \|A + B\| < ||Al] + ||BIl. 

(b) If aE F and AC B(H,HX), then aA © BH, HX) and |\aA|| = 
|| ||A]]. 

(c) [ffA€B(H,XH) andBE BX, L£), then BA ©€ BH, £) and |\BA|| 
< ||B|| |All. 


PROOF. Only (c) will be proved; the rest of the proof is left to the reader. If 
k &€X, then ||Bk|| < ||Bl| ||ki|. Hence, if he 7’, k = Ah © XH and so 
|BAh|| < || Bl] ||AAl] < [Bi All All. = 


By virtue of the preceding proposition, d(A, B) = ||A — B|| defines a 
metric on @(#, # ). So it makes sense to consider #(#, # ) as a metric 
space. This will not be examined closely until later in the book, but later in 
this chapter the idea of the convergence of a sequence of operators will be 
used. 


1.3. Example. If dim #= n < oo and dim #= m < oo, let {e,,...,e¢,} be 
an orthonormal basis for # and let {¢€,,...,¢,,} be an orthonormal basis 
for %. It can be shown that every linear transformation from # into % is 
bounded (Exercise 3). If 1 <j<n,1<is<m, let a,, = (Ae,,¢,). Then the 
m Xn matrix (a,;) represents A and every such matrix represents an 
element of @( #7, ”#). 


1.4. Example. Let /*? = /*(N) and let e,,e,,... be its usual basis. If 
A € @(I*), form a;; = (Ae,,e;). The infinite matrix (@;,) represents A as 
finite matrices represent operators on finite dimensional spaces. However, 
this representation has limited value unless the matrix has a special form. 
One difficulty is that it is unknown how to find the norm of A in terms of 
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the entries in the matrix. In fact, if 2 < n < oo, there is no known formula 
for the norm of a matrix in terms of its entries. A sufficient condition that 1s 
useful is known, however (see Exercise 11). 


1.5. Theorem. Let (X,02,u) be a o-finite measure space and put #= 
L*(X,Q, pp) = L*(p). If 6 € L®(), define M,: L*(p) > L*(w) by Myf = 
of. Then M, © B(L*(p)) and ||M,l| = |l4ll..- 


PROOF. Here ||@||,, is the p-essential supremum norm. That 1s, 


inf{sup{|o(x)|:x EN}: NE Q, p(N) = 0} 


Pll. 


= inf{c > 0: u({x © X: |o(x)| > c}) = O}. 


Thus ||¢||,, is the infimum of all c > 0 such that |@(x)| < c ae. [w] and, 
moreover, |¢(x)| < ||||,, a-e. [wu]. Thus we can, and do, assume that 9 is a 
bounded measurable function and |$(x)| < ||@||,, for all x. Soif f © L7(,), 
then /lof|? dp < |I$l.fIfl?du. That is, M, © B(L7(n)) and ||M,I| < 
llP||,.- If « > 0, the o-finiteness of the measure space implies that there is a 
set A in 92, 0 < p(A) < oo, such that |¢(x)| = ||¢||_,, — « on A. (Why?) If 
f= (u(A))'*x 4, then fe L*(w) and ||fIl,)= 1. So |MgI? = Ilo fll5 = 
(u(A))~"{alol* du = (loll. — €)*. Letting e > 0, we get that ||M,l| = [l¢ll.- 
a 


The operator M, is called a multiplication operator. The function ¢ is its 
symbol. 

If the measure space (X,{2,) is not o-finite, then the conclusion of 
Theorem 1.5 is not necessarily valid. Indeed, let (2 = the Borel subsets of 
[0, 1] and define p» on Q by p(A) = the Lebesgue measure of A if 0 € A and 
u(A) = oo if 0 € A. This measure has an infinite atom at 0 and, therefore, is 
not o-finite. Let @ = x9). Then @ € L®(u) and ||9$||,, = 1. If f © L*(w), 
then oo > {{f\? du = |f(0)|*u({0}). Hence every function in L?() vanishes 
at 0. Therefore M, = 0 and ||M,]|| < |l$l|..- 

There are more general measure spaces for which (1.5) is valid—the 
decomposable measure spaces (see Kelley [1966]). 


1.6. Theorem. Let (X, 92, 1) be a measure space and suppose k: X X X > F 
is an §2 X (2-measurable function for which there are constants c, and c, such 
that 


JkOoydu(y) sc. ae. [a], 


J ikOs vide) ser ae. [a], 
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If K: L*(p) > L?(p) is defined by 


(Kf )(x) = fk(x, y) f(y) du(y), 


then K is a bounded linear operator and ||K|| < (c,c,)'’°. 


Proor. Actually it must be shown that Kf © L?(), but this will follow 
from the argument that demonstrates the boundedness of K. If f € L?(n), 


Kf (x) < fik(x, » f(y) dey) 


= flex, yx, yA) du(y) 


lA 


fie dnc) fie(x. I OrPaw(s] © 


lA 


o¥| fik(x y) Vo)? day) 


Hence 


JIKF (x)? du(x) se f fik(% yy O)P ae) du) 


a of (r)P flk(x, y)| du(x) du(y) 


2 
< c,c,|//|*. 


Now this shows that the formula used to define Kf is finite a.e. [p], 
Kf © L*(u), and ||Kf|? < cc|/fl’. 


The operator described above is called an integral operator and the 
function k 1s called its kernel. There are conditions on the kernel other than 
the one in (1.6) that will imply that K is bounded. 

A particular example of an integral operator is the Volterra operator 
defined below. 


1.7. Example. Let k: [0,1] < [0,1] —~ R be the characteristic function of 
{(x, y): y < x}. The corresponding operator V: L7(0,1) > L7(0,1) defined 
by Vf(x) = fok(x, y) f(y) dy is called the Volterra operator. Note that 


V(x) = [ fo). 


Another example of an operator was defined in Example I.5.3. The 
nonsurjective isometry defined there is called the unilateral shift. It will be 
studied in more detail later in this book. Note that any isometry is a 
bounded operator with norm 1. 
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EXERCISES 
1. Prove Proposition 1.1. 


2. Prove Proposition 1.2. 


eS) 


. Suppose {e,} is an orthonormal basis for # and A: #-— # is a linear 
transformation such that )/||Ae,|| < oo. Show that A is bounded. 


4. Proposition 1.2 says that d(A, B) = ||A — B|| is a metric on B(#, XH). Show 
that (4%, X ) is complete relative to this metric. 


5. Show that a multiplication operator M, (1.5) satisfies M; = M, if and only if ¢ 
is a characteristic function. 


6. Let (X,2,) be a measure space and let k,,k, be two kernels satisfying the 
hypothesis of (1.6). Define 


k: XX X—>F_ byk(x,y) = [k,(,2)k,(z,y) du(z). 


(a) Show that k also satisfies the hypothesis of (1.6). (b) If K, K,, K, are the 
integral operators with kernels k,k,,k,, show that K = K,K,. What does this 
remind you of? Is more going on than an analogy? 


7. If (X,2,) is a measure space and k € L?(p X p), show that k defines a 
bounded integral operator. 


8. Let {e, } be the usual basis for /* and let {a, } be a sequence of scalars. Show 
that there is a bounded operator A on /* such that Ae, = a, e,, for all n if and 


only if { a, } is uniformly bounded, in which case ||A|| = sup{|a,,|: 1 > 1}. This 
type of operator is called a diagonal operator or is said to be diagonalizable. 


9. (Schur test) Let {a;,}/°;., be an infinite matrix such that a,; > 0 for all 1, / 


and such that there are scalars p; > 0 and B, y > 0 with 


ioe) 
2 a;;P; S Bp;. 
i=1 


oO 
» Q;;P; = YP; 
j=l 
for all i, j > 1. Show that there is an operator A on /7(N) with (Ae,,e,) = 4;; 
and ||A||? < By. 


10. (Hilbert matrix) Show that (Ae,,e,) = (i+ j + 1)~* for 0 <i, j < oo defines 
a bounded operator on /*(N U {0}) with ||A|| < 7. (See also Choi (1983].) 


11. Find the operator norm of a 2 X 2 matrix in terms of its entries. 


12. (Direct sum of operators) Let {3%} be a collection of Hilbert spaces and let 
H= ©, #,. Suppose A; © B(%,) for all i. Show that there is a bounded 
operator A on # such that A|#% = A, for all i if and only if sup,||A,|| < 00. In 
this case, ||A|| = sup,||A,|l. 
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§2. The Adjoint of an Operator 


2.1. Definition. If 4” and % are Hilbert spaces, a function u: #x #— F 
is a sesquilinear form if for h,g in #, k, f in #&, and a, B inF, 


(a) u(ah + Bg,k) = au(h,k) + Bu(g, k); 
(b) u(h,ak + Bf) = au(h,k) + Buh, f). 


The prefix “sesqui” is used because the function is linear in one variable 
but (for F =C) only conjugate linear in the other. (“Sesqui’” means 
““one-and-a-half.’’) 

A sesquilinear form is bounded if there is a constant M such that 
ju(h,k)| < M||Al| \|k|| for all h in 4% and k in &. The constant M is 
called a bound for u. 

Sesquilinear forms are used to study operators. If A € 4(#, &), then 
u(h,k) = (Ah, k) is a bounded sesquilinear form. Also, if B € @(#, #), 
u(h, k) = ¢h, Bk) is a bounded sesquilinear form. Are there any more? Are 
these two forms related? 


2.2. Theorem. Jf u: 0X X#- F is a bounded sesquilinear form with bound 
M, then there are unique operators A in B(H#, X# ) and B in B( XH, FH) such 
that 


2.3 u(h,k) = (Ah,k) = ¢h, Bk) 
for allh in # and k in X and \\A\l, ||B\| < M. 


PROOF. Only the existence of A will be shown. For each h in #, define L,: 
H-> F by L,(k) = u(h,k). Then L, is linear and |L,(k)| < M|jh|| ||Ki]. 
By the Riesz Representation Theorem there is a unique vector f in # such 
that (k, f) = L,(k) = u(h,k) and ||f|| < MI|A||. Let Ah = f. It is left as 
an exercise to show that A 1s linear (use the uniqueness part of the Riesz 
Theorem). Also, (Ah, k) = ¢k, Ah) = ¢k, f) = u(h,k). 

If A, © B(#, H#) and u(h,k) = (A,h,k), then (Ah — A,h,k) = 0 for 
all k; thus Ah — A,h = 0 for all h. Thus, A is unique. az 


2.4. Definition. If A <= A(#, X#), then the unique operator B in 
B(H, A) satisfying (2.3) is called the adjoint of A and is denoted by 
B= A*., 

The adjoint of an operator will usually be used for operators in #(#), 
rather than 4( 4, # ). There is one notable exception. 


2.5. Proposition. Jf U © B(#, X ), then U is an isomorphism if and only if 
U is invertible and U"} = U*. 


PROOF. Exercise. 
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From now on we will examine and prove results for the adjoint of 
operators in &(4#). Often, as in the next proposition, there are analogous 
results for the adjoint of operators in 4(#, X ). This simplification is 
justified, however, by the cleaner statements that result. Also, the interested 
reader will have no trouble formulating the more general statement when it 
is needed. 


2.6. Proposition. Jf A, B € @(#) and a © F, then: 


(a) (aA + B)* = adA* + B*. 

(b) (AB)* = B*A*. 

(c) A** = (A*)* = A. 

(d) If A is invertible in @(3/) and A” is its inverse, then A* is invertible 
and (A*)~} = (A74)*. 


The proof of the preceding proposition is left as an exercise, but a word 
about part (d) might be helpful. The hypothesis that A is invertible in 
B(#) means that there is an operator A~* in @(#) such that AA“! = 
A~‘A =I. It is a remarkable fact that if A is only assumed to be bijective, 
then A is invertible in @(#). This is a consequence of the Open Mapping 
Theorem, which will be proved later. 


2.7. Proposition. If A = B( #7), ||A|| = ||A*|| = ||A*A]]'7. 


Proor. For h in #, |jA|| <1, ||AA||? = (Ah, Ah) = (A*AA, AY < 
\|A*AA|| |||] < ||A*Al| < |]A*| |All. Hence ||Al|? < ||A*4]] < ||A*|] |All. 
Using the two ends of this string of inequalities gives ||A|| < ||A*|| when 
||A|| is cancelled. But A = A** and so if A* is substituted for A, we get 
\|A*|| < ||A**|| = ||A]]. Hence ||A|]| = ||A*||. Thus the string of inequalities 
becomes a string of equalities and the proof is complete. eal 


2.8. Example. Let (X, S2, 1) be a o-finite measure space and let M, be the 
multiplication operator with symbol ¢ (1.5). Then M;* is Mg, the multipli- 
cation operator with symbol 9¢. 


If an operator on F% is represented by a matrix, then its adjoint is 
represented by the conjugate transpose of the matrix. 


2.9. Example. If K is the integral operator with kernel k as in (1.6), then 
K * is the integral operator with kernel k*(x, y) = k(y, x). 


2.10. Proposition. Jf S: /* > I? is defined by S(a,,a,, ...) = 
(0, a,,a5, ...), then S is an isometry and S*(a,,Q5, ...) = (@7,@3, ...). 


PROOF. It has already been mentioned that S is an isometry (1.5.3). For (a,,) 


and (B,) in 1°, (S*(a,),(B,)) = (aq, S(B,)) = (1, 2, ---), (0, By, Bo. 
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...)) = a,B, + 038, + --- = ((a5,03, ...),(B,, By, ...)). Since this holds 
for every (B,,), the result is proved. a 


The operator S$ in (2.10) is called the unilateral shift and the operator S* 
is called the backward shift. 

The operation of taking the adjoint of an operator is, as the reader may 
have seen from the examples above, analogous to taking the conjugate of a 
complex number. It is good to keep the analogy in mind, but do not become 
too religious about it. 


2.11. Definition. If A © 4(#%), then: (a) A is hermitian or self-adjoint if 
A* = A; (b) A is normal if AA* = A*A. 


In the analogy between the adjoint and the complex conjugate, hermitian 
operators become the analogues of real numbers and, by (2.5), unitaries are 
the analogues of complex numbers of modulus 1. Normal operators, as we 
shall see, are the true analogues of complex numbers. Notice that hermitian 
and unitary operators are normal. 

In light of (2.8), every multiplication operator M, is normal; M, is 
hermitian if and only if ¢ is real-valued; M, is unitary if and only if 
|| = 1 ae. [uw]. By (2.9), an integral operator K with kernel k is hermitian 
if and only if k(x, y)=k(y,x) ae. [wu X pw]. The unilateral shift is not 
normal (Exercise 6). 


2.12. Proposition. If # is a C-Hilbert space and A © @(#), then A is 
hermitian if and only if (Ah,h) © R forallhin #. 


PRroor. If A = A*, then (Ah,h) = ¢h, Ah) = (Ah,h); hence (Ah,h) € 
R. 

For the converse, assume ¢ Ah, h) is real for every h in #. If a © C and 
h,g eH, then (A(h + ag), h + ag) = (Ah,h) + a Ah, g) + a(Ag, h) 
+ |a|*( Ag, g) © R. So this expression equals its complex conjugate. Using 
the fact that (Ah, h) and (Ag, g) © R yields 


a(Ag,h) + a(Ah, g) = ach, Ag) + aig, Ah) 
ac A*h, g) + a(A*g,h). 
By first taking a = 1 and then a = i, we obtain the two equations 
(Ag, h) + (Ah, g) = (A*h, g) + (A%g,h), 
i(Ag,h) — i( Ah, g) = —i(A*h, g) + i(A*g, hy. 
A little arithmetic implies (Ag, h) = (A*g,h), so A = A*. ia 


I 


The preceding proposition is false if it is only assumed that #% is an 


0 ‘ on R*, then (Ah, h) = 0 


R-Hilbert space. For example, if A = 1 0 
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for all h in R*. However, A* is the transpose of A and so A* # A. Indeed, 
for any operator A on an R-Hilbert space, (Ah, g) © R. 
2.13. Proposition. Jf A = A*, then 

||A]| = sup{|( Ah, A)|: [Al] = 1}. 


PRooF. Put M = sup{|( Ah, h)|: ||Al| = 1}. Lf ||A|| = 1, then |( AA, h)| < 
||A ||; hence M < ||A||. On the other hand, if ||A|| = ||g|| = 1, then 


(A(h + g),h +8) = (Ah,h) + (Ah, g) + (Ag,h) + (AB, 8) 
=<(Aligh)s <AN, 8) aX Be ATh) CAG, 8). 
Since A = A*, this implies 
(A(h+g¢),h +2) =(Ah,h) + 2Re(Ah, g) + (Ag, 8). 
Subtracting one of these two equations from the other gives 
4Re(Ah, g) = (A(h + g),h + g) — (ACh — 8), 4 - 8). 


Now it is easy to verify that |(Af, f)| < M\|fl? for any f in #. Hence 
using the parallelogram law we get 


4Re(Ah, g) < M(||h + gil? + |lA — gil’) 
= 2M((\lA\l? + Iigil) 
= 4M 


since h and g are unit vectors. Now suppose (Ah, g) = e™|( Ah, g)|. 
Replacing A in the inequality above with e~'"h gives |(Ah, g)| < M if 
\|4|| = ||g|| = 1. Taking the supremum over all g gives ||Ah|| < M when 
“|| = 1. Thus ||4j| <M. oi 


2.14. Corollary. Jf A = A* and (Ah,h) = 0 for allh, then A = 0. 


The preceding corollary is not true unless A = A*, as the example given 
after Proposition 2.12 shows. However, if a complex Hilbert space is 
present, this hypothesis can be deleted. 


2.15. Proposition. Jf # is a C-Hilbert space and A © BH) such that 
(Ah, h) = 0 for allh in #, then A = 0. 


The proof of (2.15) is left to the reader. 

If # is a C-Hilbert space and A € @(#), then B = (A + A*)/2 and 
C = (A — A*)/2i are self-adjoint and A = B + iC. The operators B and 
C are called, respectively, the real and imaginary parts of A. 
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2.16. Proposition. [f A € @(#), the following statements are equivalent. 


(a) A is normal. 
(b) ||AA|| = ||A*A]| for all h. 


If # is a C-Hilbert space, then these statements are also equivalent to: 
(c) The real and imaginary parts of A commute. 


Proor. If he #, then ||Ah||* — ||A*h||? = (AA, Ah) — (A*h, A*h) = 
((A*A — AA*)h, h). Since A*A — AA* is hermitian, the equivalence of (a) 
and (b) follows from Corollary 2.14. 

If B,C are the real and imaginary parts of A, then a calculation yields 


A*A = B* — iCB + iBC + C’, 

AA* = B? + iCB — iBC + C’. 
Hence A*A = AA* if and only if CB = BC, and so (a) and (c) are 
equivalent. i 


2.17. Proposition. If A < &(#), the following statements are equivalent. 


(a) A is an isometry. 
(b) A*A = J. 
(c) (Ah, Ag) = ¢h, g) forallh, gin #. 


PROOF. The proof that (a) and (c) are equivalent was seen in Proposition 
1.5.2. Note that if h, g © #, then (A*Ah, g) = (Ah, Ag). Hence (b) and 
(c) are easily seen to be equivalent. aw 


2.18. Proposition. [f A < @(#), then the following statements are equiv- 
alent. 


(a) A is unitary. 
(b) A is a surjective isometry. 
(c) A is a normal isometry. 


PROOF. (a) = (b): Proposition I.5.2. 
(b) = (c): By (2.17), A*A = J. But it is easy to see that the fact that A is 
a surjective isometry implies that A~! is also. Hence by (2.17) J = 
(A71)*47! = (A*)7 147! = (AA*) 7}; this implies that A*4 = AA* = J. 
(c) = (a): By (2.17), A*A = I. Since A 1s also normal, AA* = A*4 = 1 
and so A 1s surjective. | 


We conclude with a very important, though easily proved, result. 


2.19. Theorem. If A € @(#), then ker A = (ran A*)~. 


ProoF. If h € ker A and g € #, then ¢h, A*g) = (Ah, g) = 0, sokerA € 
(ran A*)*. On the other hand, if h 1 ran A* and g € #, then (Ah, g) = 
(h, A*g) = 0; so(ranA*)*CkerdA. 5 
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Two facts should be noted. Since A** = A, it also holds that ker A* = 
(ran A)~. Second, it is not true that (kerA)+ = ran A* since ran A* may 
not be closed. All that can be said is that (kerA)*= cl(ran A*) and 
(ker A*) * = cl(ran A). 


EXERCISES 
1. Prove Proposition 2.5. 
2. Prove Proposition 2.6. 
3. Verify the statement in Example 2.8. 
4. Verify the statement in Example 2.9. 
5. Find the adjoint of a diagonal operator (Exercise 1.8). 


6. Let S be the unilateral shift and compute SS* and S*S. Also compute S”"S*" 
and $*"S”. 


7. Compute the adjoint of the Volterra operator V (1.7) and V + V*. What is 
ran(V + V*)? 


8. Where was the hypothesis that # is a Hilbert space over C used in the proof of 
Proposition 2.12? 


9. Suppose A = B + iC, where B and C are hermitian and prove that B = (A + 
A*®)/2, C =(A — A*)/2i. 


10. Prove Proposition 2.15. 


11. If A and B are self-adjoint, show that AB is self-adjoint if and only if 
AB = BA. 


12. Let L°°_45a,,2z” be a power series with radius of convergence R, 0 < R < oo. If 
A € &(#) and ||A|| < R, show that there is an operator T in @(4%) such that 
for any h,g in #, (Th, g) = U%_ )a,(A"h, g). [If f(z) = La, z", the operator 
T is usually denoted by f(A).] 


13. Let A and T be as in Exercise 12 and show that ||T — ©?_,)a,A*|| > 0 as 
n— oo. If BA = AB, show that BT = TB. 


14. If f(z) = expz = L°_)z"/n! and A is hermitian, show that f(iA) is unitary. 


15. If A is a normal operator on #, show that A is injective if and only if A has 
dense range. Give an example of an operator B such that ker B = (0) but ran B 
is not dense. Give an example of an operator C such that C is surjective but 
kerC # (0). 


16. Let M, be a multiplication operator (1.5) and show that ker M, = (0) if and 
only if w({x: (x) = 0}) = 0. Give necessary and sufficient conditions on ¢ 
that ran M, is closed. 
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§3. Projections and Idempotents; Invariant and 
Reducing Subspaces 


3.1. Definition. An idempotent on # is a bounded linear operator E on # 
such that E? = E. A projection is an idempotent P such that ker P = 
(ran P)+. 


If @< #, then P, is a projection (Theorem [.2.7). It is not difficult to 
construct an idempotent that is not a projection (Exercise 1). 

Let E be any idempotent and set 4 = ran E and W= ker E. Since E is 
continuous, / is a closed subspace of #7. Notice that (J — E)* =I —2E 
+ E*=]—2E+E=I1-E; thus I] - E is also an idempotent. Also, 
0O= (1 — E)h=h — Eh, if and only if Eh =h. So ran E D ker(/ — E). 
On the other hand, if A € ranE, h = Eg and so Eh = E’g = Eg =h; 
hence ran EF = ker(/ — E). Similarly, ran(/ — EF) = ker £. These facts are 
recorded here. 


3.2. Proposition. (a) E is an idempotent if and only if I — E is an idempo- 
tent. (b) ran E = ker(J — E), kerE =ran(J — E), and both ran E and 
ker E are closed linear subspaces of #. (c) If M= ran E and N= kerE, 
then MN = (0) and MH+N=H. 


The proof of part (c) is left as an exercise. There is also a converse to (c). 
If 4,NV <4, MAN= (0), and 4+ V=#, then there is an idempo- 
tent E such that M= ran E and W= kerE; moreover, FE is unique. The 
difficult part in proving this converse is to show that E is bounded. The 
same fact is true in more generality (for Banach spaces) and so this proof 
will be postponed. 

Now we turn our attention to projections, which are peculiar to Hilbert 
space. 


3.3. Proposition. Jf E is an idempotent on # and E #0, the following 
statements are equivalent. 


(a) E is a projection. 

(b) E is the orthogonal projection of # onto ran E. 
(c) ||E|| = 1. 

(d) E is hermitian. 

(e) E is normal. 

(f) <Eh,h) = 0 for allh in #. 


PROOF. (a) = (b): Let #4 = ran E and P = Py. If h © #, Ph = the unique 
vector in .# such that h — Phe .@+=(ranE)*=kerE by (a). But 
h— Eh=(I-— E)h € kerE. Hence Eh = Ph by uniqueness. 

(b) = (c): By (1.2.7), ||E|| < 1. But Eh =h for h in ran E, so ||E|| = 1. 
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(c) = (a): Let h © (kerE)~. Now ran(J — E)=kerE, so h- Ehe 
ker E. Hence 0 = (h — Eh,h) = |h\\* — (Eh, h). Hence |jh||* = (Eh, h) 
< ||Eh|| ||A|| < ||Al|*. So for h in (ker E)+, ||Eh|| = ||Al] = (EA, hy'”*. But 
then for hf in (kerE)~, 


A — Eh||? = |\h||? — 2Re(Eh, h) + ||Eh||? = 0. 


That is, (kerE)* C ker(J — E) = ran E. On the other hand, if g € ran E, 
g=g, + g,, where g, € kerE and g, € (kerE)~. Thus g = Eg = Eg, = 
g,; that is, ran E C (kerE)*. Therefore ran E = (kerE)~ and E is a 
projection. 

(b) > (f): If he, write h=h,+h,, h, eG ranE, h,€& kerk = 
(ran E)~. Hence (Eh, h) = (E(h, + hy), hy + hy) = (Eh, hy) = 
(hy, hy) = |\hyIP? = 0. 

(f) => (a): Let h, © ranE and h, © kerE. Then by (f), 0 < (E(h, + 
h,),h, +h) = (hy, hy + (hy, hy). Hence —||A, ||? < (h,, 45) for all h, 
in ran E and h, in ker E. If there are such h, and h, with (h,,h,) =a # 0, 
then substituting k, = —2a7"||h,||?A, for h, in this inequality, we obtain 
—||h,||? < —2||h,||°, a contradiction. Hence (h,,h,) = 0 whenever h, € 
ran EF and h, © kerE. That is, E is a projection. 

(a) = (d): Let h,g © # and put h=h, +h, and g=g, + g,, where 
h,,g, © ran E and h,, g, © kerE = (ran E)* . Hence (Eh, g) = (hy, g,). 
Also, (E*h, g) = ¢h, Eg) = (hy, g,;) = (Eh, g). Thus E = E*. 

(d) = (e): clear. 

(e) = (a): By (2.16), ||Eh|| = ||£*A|| for every h. Hence ker E = kerE™*. 
But by (2.19), kerE* = (ran E)~ , so E is a projection. i 


Note that by part (b) of the preceding proposition, if E is a projection 
and @= ran FE, then FE = Py. 

Let P be a projection with ran P = 4 and kerP = YW. So both 4 and 
MN are closed subspaces of # and, hence, are also Hilbert spaces. As in 
(1.6.1), we can form WOW. If U: HON H 1s defined by U(h © g) 
=h-+g for h in 4 and g in JW, then it is easy to see that U is an 
isomorphism. Making this identification, we will often write 7= 7 @® SN. 

More generally, the following will be used. 


3.4. Definition. If {7} is a collection of pairwise orthogonal subspaces of 
H , then 
@,M,=NVN;,M,. 
If @ and / are two closed linear subspaces of #, then 
MOEN=ZMAN*. 
This is called the orthogonal difference of M@ and WN. 
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Note that if 4,V<# and W1 WNW, then 4+ VW is closed. (Why?) 
Hence “4SWV= 4+ MN. The same is true, of course, for any finite 
collection of pairwise orthogonal subspaces but not for infinite collections. 


3.5. Definition. If 4 © @(%) and 4 < #, say that @ is an invariant 
subspace for A if Ah © M@ whenever h € 4. In other words, if AZ CZ. 
Say that 4 is a reducing subspace for A if AMC and AM*CM~. 


If 42 < #, then #= 10 M- If A € BH), then A can be written as 
a 2 X 2 matrix with operator entries, 


| lw x 
3.6 a=|¥ a 


where W © B(.4), X © B(M+,M), Y © B(A,M*), and Z € BM"). 


3.7. Proposition. [fA © B(#), 4 < #H, and P = Py, then statements (a) 
through (c) are equivalent. 


(a) 4 is invariant for A. 
(b) PAP = AP. 
(c) In (3.6), Y = 0. 


Also, statements (d) through (g) are equivalent. 


(d) 4 reduces A. 

(e) PA = AP. 

(f) In (3.6), Y and X are 0. 

(g) 4 is invariant for both A and A*. 


PROOF. (a) > (b): If hE #, Ph EM. So APh © M. Hence, P(APh) = 
APh. That is, PAP = AP. 

(b) = (c): If P is represented as a 2 X 2 operator matrix relative to 
KH= M ® M~ , then 


Hence, 


So Y = 0. 
(c) = (a): If Y = 0 and h €.Z, then 
_|W X\{h\ _ | Wh 
os i Ale | 0 Jem. 
(d) = (e): Since both 4 and .4* are invariant for A, (b) implies that 
AP = PAP and A(I — P)=(I— P)A(U/ — P). Multiplying this second 
equation gives A — AP = A — AP — PA + PAP. Thus PA = PAP = AP. 
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(e) = (f): Exercise. 
(f) > (g): If X = Y = 0, then 


_|W 0 x _|W* 0 
A he a and A | , o) 
By (c), -@ is invariant for both A and A*. 

(g) = (d): If he M~ and g €.%, then (g, Ah) = (A*g,h) = 0 since 
A*g © .M. Since g was an arbitrary vector in &%, AheEM~. That is, 
AM*CM”. a 


If @ reduces A, then X = Y = O in (3.6). This says that a study of A is 
reduced to the study of the smaller operators W and Z. This is the reason 
for the terminology. 

If A © @(#) and -@ is an invariant subspace for A, then A|.Z@ is used 
to denote the restriction of A to “4. That is, A|.W is the operator on .4 
defined by (A|.@)h = Ah whenever h € .4@. Note that A|Z € @(.@) and 
||A|.4|| < ||A||. Also, if 7 is invariant for A and A has the representation 
(3.6) with Y = 0, then W = A|Z. 


EXERCISES 


1. Let # be the two-dimensional real Hilbert space R*, let 4 = {(x,0) € R*: 
x € R} and let Y= {(x, x tan@): x © R}, where 0 < 6 < 47. Find a formula 
for the idempotent FE, with ran FE, =.@ and ker E, = W. Show that ||£6|| = 
(sind)~?. 


2. Prove Proposition 3.2 (c). 


3. Let {4,: i@ I} be a collection of closed subspaces of “ and show that 
UM? : i EI} =MEM: ie T})* and (NG: iE T\)7=V{MeP 21 ET}. 


4. Let P and Q be projections. Show: (a) P + Q is a projection if and only if 
ran P | ranQ. If P + Q isa projection, then ran(P + Q) = ran P + ranQ and 
ker(P + Q) = kerP ¢ kerQ. (b) PQ is a projection if and only if PQ = QP. If 
PQ is a projection, then ran PQ = ranPMranQ and ker PQ = cl(kerP + 
kerQ). 


5. Generalize Exercise 4 as follows. Suppose {.4;: i€ J} is a collection of 
subspaces of # such that W, 14, if i#j. Let P, be the projection of # 
onto “, and show that for all h in #, 2. {P.h: i © I} converges to Ph, where 
P is the projection of # onto V{.4@;: i € T}. 


6. If P and Q are projections, then the following statements are equivalent. (a) 
P — Q is a projection. (b) ranQ C ran P. (c) PO = Q. (d) OP =Q. If P-—@Q 
is a projection, then ran(P — Q) = (ran P) © (ranQ) and ker(P — Q) = ranQ 
+ ker P. 


7. Let P and Q be projections. Show that PQ = QP if and only if P + QO— PQ 
is a projection. If this is the case, then ran(P + Q — PQ) = cl(ran P + ranQ) 
and ker(P + Q — PQ) = kerP / kerQ. 
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8. Give an example of two noncommuting projections. 


9. Let A € B(H#) and let W= graph A CHO H#H. That is, W= {h @® Ah: 
h € #}\. Because A is continuous and linear, VW < #0 #. Let “= #@ (0) < 
HO H. Prove the following statements. (a) 4 W= (0) if and only if ker A = 
(0). (b) #+ W is dense in #6 # if and only if ranA is dense in #. (c) 
M+ N=HO8 XH if and only if A is surjective. 


10. Find two closed linear subspaces 4, W of an infinite-dimensional Hilbert space 
H# such that 4A WV= (0) and 4+ W is dense in #, but WZ+ VF #. 


11. Define A: [*(Z) > °(Z) by A(..., a 1, &, 0), ...) =(...,& 4,09, %, --.), 
where ~ sits above the coefficient in the 0-place. Find an invariant subspace of A 
that does not reduce A. 


12. Let » = Area measure on D = {z € C: |z| < 1} and define A: L?(p) > L?(p) 
by (Af)(z) = 2f(z) for |z| <1 and f in L*(z). Find a nontrivial reducing 
subspace for A and an invariant subspace that does not reduce A. 


§4. Compact Operators 


It turns out that most of the statements about linear transformations on 
finite-dimensional spaces have nice generalizations to a certain class of 
operators on infinite-dimensional spaces—namely, to the compact oper- 
ators. 


4.1. Definition. A linear transformation T: #- 4 is compact if T(ball 
H ) has compact closure in %. The set of compact operators from #% into 
XH is denoted by #)(#, £), and B,(#) = B,(#, #). 


4.2. Proposition. (a) @)(4#, #) C B(#, X#). 

(b) Bo( #, HX ) is a linear space and if {T,,} C @(H, K) and T € &(H, K) 
such that ||T,, — T|| > 0, then T © @)(H#, #). 

(c) [fAEB(#), BEB XL), andT€ BH, HX), then TA and BT € 
B (HH, Xx). 


PROOF. (a) If T © B(#, XH), then cl[T(ball #)] is compact in #. Hence 
there is a constant C > 0 such that T(ball #) C {k © #: ||k|| < C}. Thus 
7] < C. 

(b) It is left to the reader to show that 4)(#, #) is a linear space. For 
the second part of (b), it will be shown that T(ball #7) is totally bounded. 
Since # is a complete metric space, this is equivalent to showing that 
T(ball #) has compact closure. Let e > 0 and choose n such that ||T — T,|| 
< e/3. Since T, is compact, there are vectors h,,...,h,, in ball # such 
that T,(ball 4) c U"_, B(T,h,; €/3). So if ||h|| < 1, there is an h, with 
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|T,2, — T,h\| < ¢/3. Thus 
|| Th; — Thl| < |\Th,; -— TA, + 2,4; — Thal + (tA — Tall 


< 2|T — T,l| + €/3 
< €E. 


Hence 7(ball #) C U"_, B(Th,; €). 
The proof of (c) is left to the reader. a 


4.3. Definition. An operator T on # has finite rank if ranT is finite 
dimensional. The set of finite-rank operators is denoted by By (#, X); 
Bo H) = Bol H, #). 


It is easy to see that By( 4, H) is a linear space and By (H, #) C 
B (HH, X) (Exercise 2). Before giving other examples of compact oper- 
ators, however, the next result should be proved. 


4.4. Theorem. Jf T € &(#, X), the following statements are equivalent. 


(a) T is compact. 

(b) 7* is compact. 

(c) There is a sequence {T,,\ of operators of finite rank such that ||T — T,,| 
=e), 


PROOF. (c) = (a): This is immediate from (4.2b) and the fact that 
Boo H, H) © By H, H). 

(a) = (c): Since cl[T(ball #)] is compact, it is separable. Therefore 
cl(ranT) = £ is a separable subspace of %. Let {e,,e,, ... } be a basis for 
f and let P, be the orthogonal projection of # onto V(e;: 1 <j <n}. Put 
T, = P,,T; note that each T, has finite rank. It will be shown that ||7,, — 7) 
— 0, but first we prove the following: 


Claim. If h € #, ||T,h — Th\| > 0. 


In fact, k = Th € &, so ||P,k — k|| > 0 by (1.4.13d) and (1.4.7). That is, 
\|P,,7h — Th\| — 0 and the claim is proved. 

Since T is compact, if e« > 0, there are vectors h,,...,h,, in ball such 
that T(ball #) C UL, B(Th;; &/3). So if ||h|| < 1, choose h, with || Th — 
Th || < ¢/3. Thus for any integer n, 


|| Th — T,hl| < \|Th — Thj| + \|Th; — T,h,|| + P,(Th; — Th) 
< 2\|Th — Th,|| + ||Th, — T,h,\| 
20/3 >P th, Tf ||: 
Using the claim we can find an integer ny such that ||7h, — 7,,h,|| < e/3 for 


1<j<m and n=np). So ||Th — T,h\| < e uniformly for h in ball #. 
Therefore ||T — T,|| < e for n = no. 
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(c) = (b): If {7,,} is a sequence in By (H’, XH) such that ||7, — 7|| — 0, 
then ||7* — T*|| = ||7, — T\| ~ 0. But T)* © Bo( 4, #) (Exercise 3). 
Since (c) implies (a), 7 * is compact. 

(b) = (a): Exercise. a 


A fact emerged in the proof that (a) implies (c) in the preceding theorem 
that is worth recording. 


4.5. Corollary. [f T © &(#, #), then cl(ranT) is separable and if {e,,} 
is a basis for c\(ranT) and P,, is the projection of X onto Vie; 1 <j <n}, 
then ||P,T — T|| — 0. 


4.6. Proposition. Let # be a separable Hilbert space with basis {e,,\; let 
{a,} CF with M = sup{|a,|: 2 = 1} < o. If Ae, =a,e, for alln, then A 
extends by linearity to a bounded operator on # with ||A|| = M. The operator 
A is compact if and only if a, > 0 asn— oo. 


PROOF. The fact that A is bounded and ||A|| = M is an exercise; such an 
operator is said to be diagonalizable (see Exercise 1.8). Let P, be the 
projection of # onto V{e,,...,e,}. Then A, = A — AP, is seen to be 
diagonalizable with A,e, = a,e, if j > and A,e,= 0 if j <n. So AP, € 
Boo ( #) and ||A,|| = sup{la,|: 7 > n}. If a, > 0, then ||A,|| > 0 and so A 
is compact since it is the limit of a sequence of finite-rank operators. 
Conversely, if A is compact, then Corollary 4.5 implies ||A,|| — 0; hence 
a, > 0. a 


4.7. Proposition. If (X, Q, pu.) is a measure space and k © L*(X x X,Q xX 
Q2, 2 X pt), then 
(Kf )(x) = fk(x, y) f(y) dn(y) 


is a compact operator and ||K\| < ||k]||>. 
The following lemma is useful for proving this proposition. 
4.8. Lemma. If {e,: i € I} is a basis for L*(X,Q,w) and 


,;(x, y) = e,(x)e(y) 
for i, j in I and x, y in X, then {,,: i, j € I} is a basis for L*(X x X,Q x 
(2, X p). 
PROOF. Since [fl;\° de dp. = lle,ll7lle,{I? = 1, 9; © L*(p Xp). If (i, 7) # 
(a, 8), then 
(dap>iz) = ff balx)e(r) 60 )4)(») d(x) daly) 


= bq» bi) (>be) = 0. 


So {¢,,} is an orthonormal family. 
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If ¢ € L?(u X p), then the fact that [{|o(x, y)|* du(x) du(y) < oo im- 
plies that {|(x, y)|? du(x) < oo for almost all y in X. That is, if (x) = 


o(x, y), then o,€ L*(p) for almost all y. Thus f(y) = (e;,9,) = 
[o(x, y)e,(x) du(x) is well defined. Moreover, f, € L’(m) (Exercise). But 


Wi = Eke; AP = E| FOV (0) dev) | 


2 


= Lf foe, veCae,() dal) dey) 


7s L$, 6). 


So if ¢ 1 ¢,, for all i, j, then f; = 0 for all i. Thus ¢, = 0 in L*(p) for 
almost all y. That is, ¢ = 0. Therefore, { 9; ;} is a basis. ny 


PROOF OF PROPOSITION 4.7. Let {e;} and {¢;;} be as in Lemma 4.8. Since 
ke L*(p X p), 


WIZ = LUCK, OI? 
ij 


2 


pe 
iJ 


J fkCx, v)eCJe;(y) du(x) du(y) 


2 


-r 
i,J 

a Lil(Ke,, e;)|?. 
i,j 


Vie y)ej(y) du(y) e,(3) du(x) 


Thus, if f = Ljaje, € L7(w), L|a,|* < 00, then 
2 


| ff, e)|° = 


dia; Ke;, e; 
j 


< [Zias?)[ LiKe, al 
Therefore, 


KAI? = LICKf.e:)1? 


< AISIAL’. 
This shows that K is bounded and ||K|| < ||k||,. 

Now assume that k is a linear combination of a finite number of the 
{$,,}. It is left to the reader to show that in this case K has finite rank. If k 
is an arbitrary element of L?( X pw), then & is in the linear span of a 
countable number of 9,,. Say that k= LP 21% mPam> Pnm(X V) = 
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Cn(X)Cm(Y)- If Ky = uM 1p. mn. m» then ||ky — k||) > 0 as N > oo. If 
K,, is the integral operator corresponding to k,, K, has finite rank and 
Ky — K\| < ||ky — &l|, ~ 0. Thus K 1s compact. a 


In particular, note that the preceding proposition shows that the Volterra 
operator (1.7) 1s compact. 

One of the dominant tools in the study of linear transformations on 
finite-dimensional spaces is the concept of eigenvalue. 


4.9. Definition. If 4 © @(#), a scalar a 1s an eigenvalue of A if ker(A — 
a) # (0). If A is a nonzero vector in ker(A — a), h is called an eigenvector 
for a; thus Ah = ah. Let o,(A) denote the set of eigenvalues of A. 


4.10. Example. Let A be the diagonalizable operator in Proposition 4.6. 
Then 6,(A) = (a), a), ...}. If a © o,(A), let J, = {7 © N: a, = a}. Then 
h is an eigenvector for a if and only if h € V{e;; j © J,}. 


4.11. Example. The Volterra operator has no eigenvalues. 


4.12. Example. Let h © #= Léi(—17,7) and define K: #># by 
(Kf (x) = [% h(x — y) f(y) dy. TEX, = Qa?) h(xexp(—inx) dx = 
h(n), the nth Fourier coefficient of h, then Ke, =\,e,, where e,(x) = 
(2a) 1/*exp(inx). 


The way to see this is to extend functions in Lé(—7,7) to R by 
periodicity and perform a change of variables in the formula for (Ke, )(x). 
The details are left to the reader. 

Operators on finite-dimensional spaces always have eigenvalues. As the 
Volterra operator illustrates, the analogy between operators on finite-dimen- 
sional spaces and compact operators breaks down here. If, however, a 
compact operator has an eigenvalue, several nice things can be said if the 
eigenvalue is not zero. 


4.13. Proposition. If T © B)(#), \ © o,(T), and  # 0, then the eigen- 
space ker(T — 2) is finite dimensional. 


PROOF. Suppose there is an infinite orthonormal sequence {e, } in ker(T — 
A). Since T is compact, there is a subsequence {e, } such that {Te,, } 
converges. Thus, {Te,, } is a Cauchy sequence. But for n, #n,, ||Te,, — 
Te,, ||" = ||Ae,, — Ae, I? = 2|A|* > 0 since A # 0. This contradiction shows 
that ker(T — A) must be finite dimensional. a 


The next result on the existence of eigenvalues is not a practical way to 
show that a specific example has a nonzero eigenvalue, but it is a good 
theoretical tool that will be used later in this book (in particular, in the next 
section). 
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4.14. Proposition. If T is a compact operator on #,  # 0, and inf{||(T — 
AJA: ||Al| = 1} = 0, thenX € a (T). 


PrRooF. By hypothesis, there is a sequence of unit vectors {h,,} such that 
(7 — A)h,|| > 0. Since T is compact, there is a vector f in # and a 
subsequence {h,, } such that ||Th, — fl] > 0. But h, =A~"[(A— T)h,, 
td.) aN f° So 1 = |jA- ‘Al = “IAL LPAI and es 0. Also, it must be 
that Th, > A~'Tf. Since Th, > f, f=-'Tf, or Tf = Af. That is, f € 
ker(T — "h) and f #0, so 4 €,(T). ics 


4.15. Corollary. [f T is a compact operator on #, X #0, A €a,(T), and 
A€o (1 *), then ran(T — A) = # and (T — A)! is a hounded operator 
on #. 


PROOF. Since A ¢ o,(T), the preceding proposition implies that there is a 
constant c > 0 such that ||(T — A)hA|| > cllAl| for all h in #. If f © clran(T 
— X), then there is a sequence {h,} in # such that (T — A)h, — f. Thus 
2, —A,,|| < ¢ (7 — A)A, — (T— A)A,,|| and so {h,} is a Cauchy se- 
quence. Hence h, — h for some h in #. Thus (T — A)h = f. So ran(T — 2d) 
is closed and, by (2.19), ran(T — A) = [ker(T — A)*]~ = #, by hypothesis. 

So for f in # let Af = the unique vector h such that (JT — A)h = f. 
Thus (T — A) Af = f for all f in #. From the inequality above, c||A/f|| < 
(7 — A)Af]| = IIA. So ||Afl] < c7{I/f| and A is bounded. Also, (T — 
A\)JA(T —A)h =(T-—-A)h, sco 0=(T—A)A(T —A)h —h]. Since A ¢€ 
0,(T), A(T —A)h=h. Thatis, A=(T—X)*. 


It will be proved in a later chapter that if A € 0,(7) and A # 0, then 
A¢€o (T *). 

More will be shown about arbitrary compact operators in Chapter VI. In 
the next section the theory of compact self-adjoint operators will be 
explored. 


EXERCISES 
1. Prove Proposition 4.2(c). 
2. Show that every operator of finite rank is compact. 


3. If TE By(#,#), show that T* ©€ Bo(#,#) and dim(ranT) = 
dim(ran T* ). 


4. Show that an idempotent is compact if and only if it has finite rank. 
5. Show that no nonzero multiplication operator on L*(0,1) is compact. 


6. Show that if T: #— 4 is a compact operator and {e,,} 1s any orthonormal 
sequence in #, then ||Te,|| > 0. Is the converse true? 
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7. If T is compact and -@ is an invariant subspace for 7, show that 7|.@ is 
compact. 


8. If h,g € #, define T: #— # by Tf = (f,h)g. Show that T has rank 1 [that 
is, dim(ran 7) = 1]. Moreover, every rank 1 operator can be so represented. 
Show that if 7 is a finite-rank operator, then there are orthonormal vectors 
€),...,@, and vectors g),...,g, such that Th = L7_,(h,e,)g, for all A in #. 
In this case show that T is normal if and only if g, = A,e, for some scalars 
A,,.-.,A,,. Find 6,(T). 


9. Show that a diagonalizable operator is normal. 
10. Verify the statements in Example 4.10. 
11. Verify the statement in Example 4.11. 


12. Verify the statement in Example 4.12. (Note that the operator K in this example 
is diagonalizable.) 


13. If T, € @(%,), n = 1, with sup, ||T7,|| < co and T= ©” JT, on H= O%_ H 


n= |" nn? 


show that T is compact if and only if each 7; is compact and |{7; || > 0. 


§5*. The Diagonalization of Compact 
Self-Adjoint Operators 


This section and the remaining ones in this chapter may be omitted if the 
reader intends to continue through to the end of this book, as the material 
in these sections (save for Section 6) will be obtained in greater generality in 
Chapter IX. It is worthwhile, however, to examine this material even if 
Chapter IX is to be read, since the intuition provided by this special case is 
valuable. 

The main result of this section is the following. 


3.1. Theorem. If T is a compact self-adjoint operator on #, {h,,d ,...} 
are the distinct nonzero eigenvalues of T, and P,, is the projection of # onto 
ker(T — 4,,), then PP, = P,,P,, = 0 ifn # m, each X, is real, and 


5.2 T=) Ap? 


where the series converges to T in the metric defined by the norm of B(#). 
[Of course, (5.2) may be only a finite sum.]| 


The proof of Theorem 5.1 requires a few preliminary results. Before 
beginning this process, let’s look at a few consequences. 
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5.3. Corollary. With the notation of (5.1): 


(a) ker T = [V{P,#: n> 1}]*+=(ranT)-; 
(b) each P,, has finite rank; 
(c) ||7|| = sup{]A,|: 2 => 1} and A, 7-0 asn- o. 


Proor. Since P, 1 P,, for n # m, if h © #, then (5.2) implies ||Th||? = 
Le IA, 2A? = D2, /A, ||P, 21/7; Hence Th = 0 if and only if P,A = 0 for 
all n. That is, h € kerT if and only if h 1 P,# for all n, whence (a). 

Part (b) follows by Proposition 4.13. 

For part (c), if #=cl[ran7], Y is invariant for T. Since T = 7%, 
L= (kerT)* and £ reduces T. So we can consider the restriction of T to 
L, TIL. Now $= V{P,H: n = 1} by (a). Let fe: 1 <j < N,} bea 
basis for P= ker(T — ,,), so Te” = d,e\” for 1 <j < N,. Thus {e}”: 
1<j<N,, n= 1} 18a basis for # and 7|/£ is diagonalizable with respect 
to this basis. Part (c) now follows by (4.6). a 


The proof of (c) in the preceding corollary revealed an interesting fact 
that deserves a statement of its own. 


5.4. Corollary. Jf T is a compact self-adjoint operator, then there is a 
sequence {,,} of real numbers and an orthonormal basis {e,} for (ker T)~ 
such that for all h, 


Th a » HSA, ene n- 


n=1 


Note that there may be repetitions in the sequence {p,,} in (5.4). How 
many repetitions? 


5.5. Corollary. If T€ @(#), T=T*, and kerT = (0), then # is 


separable. 


Also note that by (4.6), if (5.2) holds, T © #)(#). 
To begin the proof of Theorem 5.1, we prove a few results about not 
necessarily compact operators. 


5.6. Proposition. Jf A is a normal operator and X © F, then ker(A — A) = 
ker(A — A)* and ker(A — X) is a reducing subspace for A. 


PROOF. Since A is normal, so is A — A. Hence ||(A — A)A|| = ||(A — A)*Al| 
(2.16). Thus ker(A — A) = ker(A — A)*. If h © ker(A — A), Ah=AhE 
ker(A — A). Also A*h = Ah € ker(A — A). Therefore ker(A — A) reduces 
A, S 


5.7. Proposition. [f A is a normal operator and X, p. are distinct eigenvalues 
of A, then ker(A — A) 1 ker(A — pz). 
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PrRoor. If 4 € ker(A — A) and g€ ker(A — p), then the fact (5.6) that 
A*g = pg implies that Ach, g) = (Ah, g) = (h, A*g) = (h, ig) = 
uch, g). Thus (A — p)(h, g) = 0. Since A — p #0, A 1 g. eS 


5.8. Proposition. Jf A = A* and \ € o,(A), then X is a real number. 


Proor. If Ah = AA, then Ah = A*h = Nh by (5.6). So (A — A)h = O. Since 
h can be chosen different from 0, A = X. a 


The main result prior to entering the proof of Theorem 5.1 is to show that 
a compact self-adjoint operator has nonzero eigenvalues. If (5.3c) is ex- 
amined, we see that there is a A, in o,(7) with |A,| = ||7||. Since the 
preceding proposition says that A, © R, it must be that A, = +||7||. That 
is, either + ||7|| © o,(7’). This is the key to showing that o,(7T) is nonvoid. 


5.9. Lemma. If T is a compact self-adjoint operator, then either +||T|| is an 
eigenvalue of T. 


PROOF. If T = 0, the result is clear. So suppose T # 0. By Proposition 2.13 
there is a sequence {h,,} of unit vectors such that |(7h,,h,)| > ||T||. By 
passing to a subsequence if necessary, we may assume that (7h,,h,) > X, 
where |A| = ||7||. It will be show that A € o,(T). Since |A| = ||T||, 0 < 
I(T — AJA, | = ITA, ||? — 2X¢Th,, h,) + < 2 — 2NTh,,h,) > 0. 
Hence ||(T — A)h,|| > 0. By (4.14), A € a (T). Z 


PROOF OF THEOREM 5.1. By Lemma 5.9 there is a real number A, in 0,(T) 
with |A,| = ||Z]|. Let &, = ker(T — A,), P, = the projection onto &,, #%, = 
é&> . By (5.6) &, reduces T, so #, reduces T. Let T, = T|%,; then T, is a 
self-adjoint compact operator on #,. (Why?) 

By (5.9) there is an eigenvalue A, for 7, such that |A,| = ||T7,||. Let 
6, = ker(T, — A). It is easy to check that &, = ker(T — X,) andso)d, # A,. 
Let P, = the projection of # onto &, and put #, =(&, ® €,)+. Note 
that ||73|| < ||Z|| so that |A,| < |A,|. 

Using induction (give the details) we obtain a sequence {A,,} of real 
eigenvalues of 7 such that 


(1) JA, | = |A2| eee 
(ii) If & = ker(T ~A,,), Ansil = ITM, @ ++ OE,“ II. 


By (1) there is a nonnegative number a such that |A,| — a. 
Claim. a = 0; that is, imA, = 0. 


In fact, let e, € &,, |le,|| = 1. Since T is compact, there is an h in # and 
a subsequence {e,, } such that ||Te, — A|| > 0. But e, 1 e,, for n # m and 
J 
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Te, =, e,. Hence ||Te, — Te, ||? =X, +, = 2a°. Since {Te, } is a 
Cauchy sequence, a=0.— 

Now put P, = the projection of # onto &, and examine T — L"_,A,P.. 
If heé,, 1<k <n, then (T— L%_,A,P)h = Th—A,h =0. Hence &, 
®--. 6, Cker(T — Li_jA,P). If he (é, @--- 6€,)~, then Ph =0 
for 1 <j <n; so (T— L*_,A,P,)h = Th. These two statements, together 
with the fact that (@, ® --- ®&,)~ reduces T, imply that 


= ||T|\(6, @ --- @&,) "|| 


DD hP, 
j=l 


= Lvwer > 0. 
Therefore the series U°°_,A,,P, converges in the metric of @(#) to T. - 


Theorem 5.1 is called the Spectral Theorem for compact self-adjoint 
operators. Using it, one can answer virtually every question about compact 
hermitian operators, as will be seen before the end of this chapter. 

If in Theorem 5.1 it is assumed that T is normal and compact, then the 
same conclusion, except for the statement that each A, is real, is true 
provided that # is a C-Hilbert space. The proof of this will be given in 
Section 7. 


EXERCISES 
1. Prove Corollary 5.4. 
2. Prove Corollary 5.5 


3. Let K and k be as in Proposition 4.7 and suppose that k(x, y) = k(y, x). Show 
that K is self-adjoint and if {u,} are the eigenvalues of K, each repeated 
dim( K — p,,) times, then D3 |p, |* < oo. 


4. If T is acompact self-adjoint operator and {e,,} and {p,,} are as in (5.4) and if h 
is a given vector in #, show that there is a vector f in # such that 7f =h if 
and only if h 1 kerT and Ly, 7|¢h,e,,)|? < oo. Find the form of the general 
vector f such that Tf = h. 


5. Let T, {y,,}, and {e,} be as in (5.4). If A #0 and A # yp, for any p,,, then for 
every h in # there is a unique f in # such that (T — A)f = h. Moreover, 
f=A'[h + UA, (A —A,,) Kh, e,)e,]. Interpret this when T is an integral 
operator. 


§6*. An Application: Sturm—Liouville Systems 


In this section, [a,b] will be a proper interval with — oo < a<b< oo. 
C[a,b] denotes the continuous functions f: [a,b] R and for n> 1, 
C”[a, b] denotes those functions in C[a, b] that have n continuous deriva- 
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tives. C{”[a, b] denotes the corresponding spaces of complex-valued func- 
tions. We want to consider the differential equation 


6.1 —~h" + gh—dAh=f, 


where A is a given complex number, g € C[a,b], and f € L’[a, b], to- 
gether with the boundary conditions 


(a) ah(a) + a,h'(a) = 0 
(b) Bh(b) + B,h’(b) = 0° 


where a, a,, 8, and 8, are real numbers and a* +a? >0, B*+ B7 > 0. 

Equation (6.1) together with the boundary conditions (6.2) is called a 
(regular) Sturm—Liouville system. Such systems arise in a number of physi- 
cal problems, including the description of the motion of a vibrating string. 
In this section we will discuss solutions of the Sturm—Liouville system by 
relating the system to a certain compact self-adjoint integral operator. 

Recall that an absolutely continuous function h on [a, b] has a derivative 
ae. and h(x) = [7h’(t) dt + h(a) for all x. 

Define 


9, = {he C%C[a, b]: h’ is absolutely continuous, 
h” & L?[a, b], and h satisfies (6.2a) }. 


9, is defined similarly but each h in Q, satisfies (6.2b) instead of (6.2a). 
The space J= 9,1 J,,. 
Define L: J —> L?{a, b] by 


6.3 Lh = —h" + gh. 


L is called a Sturm—Liouville operator. 

Note that @ is a linear space and L is a linear transformation. The 
Sturm-Liouville problem thus becomes: if A € C and f € L?[a, b], is there 
an hin 9 with(L — A)hA = f. Equivalently, for which A is f inran(L — A)? 

By placing a suitable norm on &Y, L can be made into a bounded 
operator. This does not help much. The best procedure is to consider 
(L — d)7 1. Integration is the inverse of differentiation, and it turns out that 
(L — X)~* (when we can define it) is an integral operator. 

Begin by considering the case when A = 0. (Equivalently, replace q by 
gq —.) To define L~' (even if only on its range), we need that L is 
injective. Thus we make an assumption: 


6.4 If h € 9 and Lh = 0, then h = 0. 


The first lemma is from ordinary differential equations and says that 
certain initial-value problems have nontrivial (nonzero) solutions. 


6.2 


6.5. Lemma. Jf a,a,, 8,8, € R, a*+a?>0, and B* + B? > 0, then 
there are functions h,,h, in B,,B,, respectively, such that L(h,) = 0 and 
L(h,) = 0 andh,, h, are real-valued and not identically zero. 
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The Wronskian of h, and h, is the function 
a b 
h’, hj, 
Note that W’ =h hf — huh, =h,(qh,) — (qh,)h, = 0. Hence W(x) = 
W(a) for all x. 


|- hh’, — hi hy. 


6.6. Lemma. Assuming (6.4), W(a) #0 and so h, and h, are linearly 
independent. 


Proor. If W(a) = 0, then linear algebra tells us that the column vectors in 

the matrix used to define W(a) are linearly dependent. Thus there is a A in 

R such that h,(a) = Ah,(a) and h,(a) = Ah‘,(a). Thus h, © 9 and L(h,) 

= 0. By (6.4), h, = 0, a contradiction. a 

Put c = W(a) and define g: [a, b] x [a,b] > R by 

ch (x)h,(y) ifa<x<y<b 

6.7 g(x, y) = at : : 
ch (y)h,(x) iLa<y<x<b. 


The function g is the Green function for L. 

6.8. Lemma. The function g defined in (6.7) is real-valued, continuous, and 
g(x, ¥) = g(y, x). 

PROOF. Exercise. 


6.9. Theorem. Assume (6.4). If g is the Green function for L defined in (6.7) 
and G: L*{a, b] > L?[a, b] is the integral operator defined by 


(Gf )(x) = f "a(x, ») f(y) a, 


then G is a compact self-adjoint operator, ranG = 9, LGf=f for all f in 
L?[a, b], and GLh = h for allh in Q. 


PROOF. That G is self-adjoint follows from the fact that g is real-valued and 
g(x, y) = g(y, x); G is compact by (4.7). Fix f inL*[a, b] and put h = Gf. 
It must be shown that h € J. 

Put 


H,(x) = of hy) FQ) dy and H,(x) = ot PFO) dy. 
Then 
h(x) = favo) 


= TPA (y)hg(x) f(y) dy + oP h(x) ay) dy. 
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That is, h = H,h,+h,H,. Differentiating this equation gives h’ = 
(ch f)h, + H,hi, thi, +h(-cth,f) = H,h, + h,H, a.e. Since 
H hj, + h’,H, is absolutely continuous, as part of showing that h © D we 
want to show the following. 


Claim. h’ = Hh’, + h’,H, everywhere. 


Put ¢ = Hh, + h),H, and put ¥(x) = h(a) + [3o(y) ad. So h and 
are absolutely continuous, h(a)=y(a), and h’=y’ ae. Thus h=y 
everywhere. But y has a continuous derivative @, so h does too. That is, the 
claim is proved. 

Differentiating h’ = Hh) + h“,H, gives that ae, h” =(c 'h,f)hj, + 
H hi + hv’H, + h’(—c 'h,f); since each of these summands belongs to 
L?[{a, b], h” © L?[a, 6]. 

Because H,(a)=0 and h,©€@Q,, ah(a)+ aha) = ah,(a)H,(a) + 
ah’ (a)H,(a) = [ah,(a) + a,h’(a)|H,(a) = 0. Hence h © Y,. Similarly, 
h © M,. Thus h € &. Hence ranG Cc JF. 

Now to show that LGf=/f. If h= Gf, L(h) = —-h" + qh= 
—[o"h hy f + Ah, i ho, =~ c hh, f] He q(H,h, By h,H,) = (—hi, a; 
gh,)H, + (-h% + gh,)H, + c(Ah, — hyhi)f =f since L(h,) = 
L(h,) = 0 and hth, -—h,h, =We=c. 

If h € Z, then Lh € L?[a, b]. So by the first part of the proof, LGLh = 
Lh. Thus 0 = L(GLh — h). Since ker L = (0), h = GLA and so h € ranG. 

ca 


6.10. Corollary. Assume (6.4). If he QB, XEC, and Lh=Xh, then 
Gh = d~th. Ifh © L?[a, b] and Gh = X~"h, thenh © @ and Lh = Xh. 


PROOF. This is immediate from the theorem. z 


6.11. Lemma. Assume (6.4). If a © 0,(G), then dimker(G — a) = 1. 


PROOF. Suppose there are linearly independent functions h,,h, in ker(G — 
a). By (6.10), h,, h, are solutions of the equation 


—h"” +(q-a‘)h=0. 
Since this is a second-order linear differential equation, every solution of it 
must be a linear combination of h, and h,. But h,, h, © Z so they satisfy 
(6.2). But a solution can be found to this equation satisfying any initial 


conditions at a—and thus not satisfying (6.2). This contradiction shows 
that linearly independent h,,h, in ker(G — a) cannot be found. a 


6.12. Theorem. Assume (6.4). Then there is a sequence {A,,A5,...} of real 
numbers and a basis {e,,e@5,...} for L?[a, b] such that 


(a) 0 < jA,| < |A,| < -:- and X, > o. 
(b) e, © BZ and Le, =X, ¢e,, for all n. 
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(c) If \#X,, for any X,, and f € L’{a, b], then there is a unique h in D with 
Lh — Xh =f. 

(d) If X\=A,, for some n and f € L?[a, b], then there is an h in B with 
Lh — \h = f if and only if (f,e,,) = 9. If (f,e,) = 0, any two solutions 
of Lh — Xh = f differ by a multiple of e,,. 


PROOF. Parts (a) and (b) follow by Theorem 5.1, Corollary 6.10, and Lemma 
6.11. For parts (c) and (d), first note that 


6.13 Lh — Xh = f if and only if h — AGh = Gf. 


This is, in fact, a straightforward consequence of Theorem 6.9. 

(c) If A#AX,, for any n, A~* €,(G). Since G = G*, Corollary 4.15 
implies G — A~! is bijective. So if f € L*[a, 5], there is a unique A in 
L?[{a, b] with Gf = (A7! — G)h. Thus h € @ and (6.13) implies L(A/A) — 
A(h/A) = f. 

(d) Suppose A = A, for some n. If Lh — A, h = f, then h — A,Gh = Gf. 
Hence (Gf,e,) = ¢(h,e,) —A,(Gh,e,) = (h,e,) — A,Ch, Ge, = ¢h,e,) 
—r,A_Kh,e,) = 0. So 0 = (Gf,e,) = (f,Ge,) =A,(f,e,). Hence f 1 
en: 

Since Ce, = ker(G — A>"), [e,] + = VM reduces G. Let G, = GW. So G, 
is a compact self-adjoint operator on W and A, €4,(G,). By (4.15), 
ran(G, — A,,) = 4. As in the proof of (c), if f 1 e,, there is a unique / in 
MN such that Lh — \,h = f. Note that h + ae, is also a solution. If h,, h, 
are two solutions, h, — h, © ker(L — X,,), so h, — hy = ae,,. | 


What happens if ker ZL # (0)? In this case it is possible to find a real 
number wp such that ker(L — p) = (0) (Exercise 6). Replacing q by gq — pn, 
Theorem 6.12 now applies. More information on this problem can be found 
in Exercises 2 through 5. 


EXERCISES 


1. Consider the Sturm—Liouville operator Lh = —h” with a = 0, b= 1, and for 
each of the following boundary conditions find the eigenvalues {A,,}, the 
eigenvectors {e,}, and the Green function g(x,y): (a) h(0) = A(1) = 0; (b) 
h’(0) = h’(1) = 0; (c) h(0) = Oand h’(1) = 0; (d)A(O) = A’(O) and h(1) = —A‘()). 


2. In Theorem 6.12 show that £%_,A;7 < oo (see Exercise 5.3). 


3. In Theorem 612 show that h@Q if and only if h € L*[a,b] and 
Le Mh, e, |? < oo. If h © B, show that h(x) = L7_1(h,e,,)e, (x), where 


this series converges uniformly and absolutely on [a, 5]. 


4. In Theorem 6.12(c), show that h(x) =X%_,(A, — A) ‘(f,e,)e,(x) and this 
series converges uniformly and absolutely on [a, }]. 


5. In Theorem 6.12(d), show that if fie, and Lh-—A,h=f, then h(x) = 
ey Ni vA) Xf, e,)e;(x) + ae, (x) for some a, where the series converges 
uniformly and absolutely on [a, b]. 
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6. This exercise demonstrates how to handle the case in which ker L # (0). (a) If 
h,g © C”[a, b] with h’, g’ absolutely continuous and h”, g” € L*[{a, b], show 
that 


fos ~ hg’) = [h’(b) g(b) — h(b) 8'()] —[h'(a) g(a) — h(a) 8'(a)]. 


(b) If h, g © J, show that (Lh, g) = (h, Lg). (The inner product is in L*[a, b].) 
(c) If h,g ©€D and A,wER, A #y, and if he ker(L — A), g € ker(L — p), 
then A 1 g. (d) Show that there is a real number p with ker(L — p) = (0). 


§7*. The Spectral Theorem and Functional Calculus 
for Compact Normal Operators 


We begin by characterizing the operators that commute with a diagonaliz- 
able operator. If one considers the definition of a diagonalizable operator 
(4.6), it is possible to reformulate it in a way that is more tractable for the 
present purpose and closer to the form of a compact self-adjoint operator 
given in (5.2). Unlike (4.6), it will not be assumed that the underlying 
Hilbert space is separable. 


7.1. Proposition. Let {P,;: i € I} be a family of pairwise orthogonal pro- 
Jections in BH). (That is, P,P, = PP,=0 for i#j.) If hE #, then 
Lu, {P;h: i © I} converges in # to Ph, where P is the projection of # onto 
V{PH#:i1€T}. 


This appeared as Exercise 3.5 and its proof is left to the reader. 

If {P,; i © IT} is as in the preceding proposition and 4, = P,#, then 
with the notation of Definition 3.4, P is the projection of # onto ®,.%,. 
Write P = ,P.. A word of caution here: Ph = £,P,h, where the conver- 
gence is in the norm of #. However, 1,P. does not converge to P in the 
norm of 4(#). In fact, it never does unless J is finite (Exercise 1). 


7.2. Definition. A partition of the identity on # is a family { P.: i € I} of 
pairwise orthogonal projections on # such that V,P,#?= #. This might be 
indicated by 1 = 1, P, or 1 = ©®,P,. [Note that 1 is often used to denote the 
operator on # defined by 1(h) =/h for all h. Similarly if a € F, a is the 
operator defined by a(h) = ah for all h.] 


7.3. Definition. An operator A on # is diagonalizable if there is a 
partition of the identity on #7, {P,: i€ J}, and a family of scalars {a;: 
i © J} such that sup,|a,| < oo and Ah = a,h whenever h € ran P.. 


It is easy to see that this is equivalent to the definition given in (4.6) when 
H is separable (Exercise 2). Also, ||A|| = sup,|a,|. 
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To denote a diagonalizable operator satisfying the conditions of (7.3), 
write 


A=Y'a,P, or A= 0,a,P,. 
1 
Note that it was not assumed that the scalars a, in (7.3) are distinct. 


There is no loss in generality in assuming this, however. In fact, if a; = a 
then we can replace P; and P, with P; + P.. 


sie 


7.4. Proposition. An operator A on # is diagonalizable if and only if there 
is an orthonormal basis for # consisting of eigenvectors for A. 


PROOF. Exercise. 


Also note that if A = @,a,P,, then A* = @,a,;P, and A is normal 
(Exercise 5). 


7.5. Theorem. Jf A = ©,a,P, is diagonalizable and all the a, are distinct, 
then an operator B in @(#) satisfies AB = BA if and only if for each i, 
ran P, reduces B. 


Proor. If all the a, are distinct, then ran P, = ker(A — a,). If AB = BA 
and Ah=a,h, then ABh = BAh = B(a,;h) = a,Bh; hence Bh € ran P, 
whenever © ran P.. Thus ran P, is left invariant by B. Therefore B leaves 
V{ranP: j #i} =A, invariant. But since @,P,= 1, ¥W, = (ran P,)~. 
Thus ran P, reduces B. 

Now assume that B is reduced by each ran P,. Thus BP. = P.B for all i. 
If h € #, then Ah = Y,a,P.h. Hence BAh = V,a,BPh = L,a,PBh = ABh. 


or ie | 


(Why is the first equality valid?) = 


Using the notation of the preceding theorem, if AB = BA, let B, = 
B\|ran P,. Then it is appropriate to write B= ©,B, on #= ©,(P,#). One 
might paraphrase Theorem 7.5 by saying that B commutes with a diagonal- 
izable operator if and only if B can be “diagonalized with operator entries.” 


7.6. Spectral Theorem for Compact Normal Operators. /f T is a compact 
normal operator on the complex Hilbert space #, {X,,A5,...\ are the 
distinct nonzero eigenvalues of T, and P, is the projection of # onto 
ker(T — A,,), then PP, = P,P, = 9 ifn # mand 


7.7 PSD ha. 
n=1 


where this series converges to T in the metric defined by the norm on B(H#). 


Proor. Let A = (T+ T*)/2, B=(T— T*)/2i. So A,B are compact 
self-adjoint operators, T = A + iB, and AB = BA since T is normal. The 


II.7. The Spectral Theorem and Functional Calculus for Compact Normal Operators 57 


idea of the proof is rather simple. We'll get started in this proof together but 
the reader will have to complete the details. 

By Theorem 5.1, A = La, E,, where a, € R, a, # a,, if n # m, and E, 
is the projection of # onto ker(A — a): Since AB = "BA, the idea is to 
use Theorem 7.5 and Theorem 5.1 applied to B to diagonalize A and B 
simultaneously; that is, to find an orthonormal basis for # consisting of 
vectors that are simultaneously eigenvectors of A and B. 

Since BA = AB, E, 4#= L, reduces B for every n (7.5). Let B, = B|Y 
then B, = B* and dim Y, < oo. Applying (5.1) to B, (or, rather, the 
corresponding theorem from linear algebra) there is a basis {e(”: 1 <j < 
d,,} for ZY, and real numbers { B{”’: 1 <j < d,,} such that Be”) = BMe™. 
Thus Te") = = Ae\” + iBe;”) = = eC + iB” ye”. 

Therefore {e{”: 1 <j <d,, n = 1} is a basis for cl(ran A) consisting of 
eigenvectors for T. It may be "that cl(ran A) # cl(ran7’). Since B is reduced 
by ker A = (ran A)~ and B, = B\ker A is a compact self-adjoint operator, 
there is an orthonormal bacis {e: j = 1} for cl(ran By) and scalars { B: 
j = 1} such that Bef? = Be. It follows that Te = iBe. Moreover, 
ker 7 C kerA ker B, so cl(ran 7) C cl(ran A) @ cl(ran B)). 

The remainder of the proof now consists in a certain amount of book- 
keeping to gather together the eigenvectors belonging to the same eigenval- 
ues of T and the performing of some light housekeeping chores to obtain 
the convergence of the series (7.7) i 


7.8. Corollary. With the notation of (7.6): 


(a) kerT = [V{ P37: n>=1}]-; 
(b) each P,, has finite rank; 
(c) ||7]| = sup{|A,|: 2 = 1} and X, ~ 0 asn- oo. 


The proof of (7.8) is similar to the proof of (5.3). 


7.9. Corollary. If T is a compact operator, then T is normal if and only if T 
is diagonalizable. 


If T is a normal operator which is not necessarily compact, there is a 
spectral theorem for T which has a somewhat different form. This theorem 
states that JT can be represented as an integral with respect to a measure 
whose values are not numbers but projections on a Hilbert space. Theorem 
7.6 will be a consequence of this more general theorem and correspond to 
the case in which this projection valued measure is “atomic.” 

The approach to this more general spectral theorem will be to develop a 
functional calculus for normal operators T. That is, an operator $(T) will 
be defined for every bounded Borel function ¢ on C and certain properties 
of the map ¢ > ¢(T) will be deduced. The projection valued measure will 
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then be obtained by letting »(A) = x,(7). These matters are taken up in 
Chapter IX. 

At this point, Theorem 7.6 will be used to develop a functional calculus 
for compact normal operators. For the remainder of this section #7 1s a 
complex Hilbert space. 


7.10. Definition. Denote by /“(C) all the bounded functions ¢: C — C. If 
T is a compact normal operator satisfying (7.7), define $(T): > # by 


¢(T) = y $(A,,)P, + (0) Po, 


n=] 


where P, = the projection of # onto ker T. 


Note that (7) is a diagonalizable operator and ||@(7’)|| = 
sup{|o(0)|, |6(A,)|,--. } (4.6). Much more can be said. 


7.11. Functional Calculus for Compact Normal Operators. /f T is a compact 
normal operator on a C-Hilbert space #, then the map ¢+> $(T) of 
1*(C) — B(#) has the following properties: 


(a) 6 > $(T) is a multiplicative linear see of 1~(C) into B(H#). If d = 1, 
o(T) = 1; if o(z) =z, then o(T) = 

(b) []o(T Il = sup{|d(A)|: A € 9,(T)}. 

(c) o(T)* = $*(T), where $* is the function defined by @*(z) = (2). 

(d) If A € B(#) and AT = TA, then Ad(T) = 6(T)A for all @ in 1@(C). 


ProoFr. Adopt the notation of Theorem 7.6 and (7.10). 

(a) If ¢,~e/~™(C), then (d¥)(z)= $(z)¥(z) for z in C. Also, 
$(T)¥(T )h = [(0)Py + LO(A,,) Pl WO)Po + LY(A,,) Pal = [(0) Py + 
L(A, ) Pv O)P A+ 2, WOA,,) P22). Since P,P, = 0 when n # m, this 
gives that $(T)¥(T)h = $(0) (0) Poh + L,0(A, VA, ) Py = (OW TA. 
Thus ¢ — $(T) is multiplicative. The linearity of the map is left to the 
reader. If o(z)=1, then $(T) = 1(7) = Py) + ©, P, = 1 since 
{ Po, P,,...} 18 a partition of the identity. If @(z) = z, o(A,,) =A,, and so 
o(T) = Tf. 

Parts (b) and (c) follow from Exercise 5. 

(d) If AT = TA, Theorem 7.5 implies that P)#’, P}#?,... all reduce A. 
Fix h, in P,4#,n>0. If ¢ € 1°(C), then Ah, € P,# and so $(T)Ah,, = 
6(A,,)Ah, = A(O(A, )A,,) = AG(T )h,. If h € #, then h = Ue_oh,, where 
h,&P,. Hence $(T)Ah = L°_ .6(T)Ah, = Ee A(T)h. = = Ad(T )h. 
(Justify the first equality.) i 


Which operators on # can be expressed as $(7) for some ¢ in /“(C)? 
Part (d) of the preceding theorem provides the answer. 
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7.12. Theorem. Jf T is a compact normal operator on a C-Hilbert space, 
then {o(T): ¢ © 1°(C)} is equal to 


{B & B(#): BA = AB whenever AT = TA}. 


Proor. Half of the desired equality is obtained from (7.11d). So let 
B & &(#) and assume that BA = AB whenever AT = TA. Thus, B must 
commute with T itself. By (7.5), B is reduced by each P, H= #, n = 0; 
put B, = BL#,. Fix n> 0 for the moment and let A, be any bounded 
operator in 4(.#,). Define Ah=A,h if he He and Ah=O0if he #, 
m # n, and extend A to # by linearity; soA = ®°_,A,, where A,, = O if 
m#n. By (7.5), AT = TA; hence BA = AB. This implies that B.A, = 
A,,B,. Since A,, was arbitrarily chosen from 4(%,), B, = B, for some B, 
(Exercise 7). If ¢: C >C is defined by $(0) = 6, and $(A,)= 8B, for 
n> 1, then B= 9(T). a 


7.13. Definition. If A © @(#), then A 1s positive if (Ah, h) = 0 for all h 
in #. In symbols this is denoted by A > 0. 


Note that by Proposition 2.12 every positive operator is self-adjoint. 


7.14. Proposition. If T is a compact normal operator, then T is positive if 
and only if all its eigenvalues are positive real numbers. 


Proor. Let T= 2PA,P,. If T>0 and hE P,# with |[hl|| = 1, then 
Th = ,,h. Hence X,, = (Th,h) = 0. Conversely, assume each A, > 0. If 
hEH#, h=hy+ U_jh,, where hy © kerT and h, © P,4 for n> 1. 
Then Th = LYA,h,. Hence (Th, h) = (L7_\\,h,, ho + U2_ih,,) = 
ratem=0N KAgs Am) = L2_,A,||2,|? = O since (h,,h,,) = O when n # m. 
Si 


7.15. Theorem. Jf T is a compact self-adjoint operator, then there are unique 
positive compact operators A, B such that T = A — B and AB = BA = 0. 


PROOF. Let T = L_,A,P,, as in (7.6). Define ¢,~: C > C by $(A,) =A, 
if A, > 0, o(z) = 0 otherwise; Y(A,,) = —A,, if A, < 0, W(z) = 0 other- 
wise. Put A = $(T) and B=y(T). Then A=L{A,P: A, > 0} and 
B= {-A,P,: A, < 0}. Thus T= A — B. Since ¢) = 0, AB = BA = 0 
by (7.1la). Since $,~ >0, A,B>O by the preceding proposition. It 
remains to show that A, B are unique. 

Suppose T = C — D where C,D are compact positive operators and 
CD = DC = 0. It is easy to check that C and D commute with T. Put 
A, =0 and P) = the projection of # onto kerT. Thus C and D are 
reduced by P,#= #, for all n> 0. Let C,= C|H%, and D, = D\%. So 
C,D, = DC, = 9, A, P, = T\%, = C, — D,, and C,, D, are positive. Sup- 
pose A, >0 and let hE #. Since C,D,=0, kerC, D cl{ran D,] = 
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(kerD,)+. So if h © (kerD,)*, then A,h = —D,h. Hence A, ]||h\|* = 
-—(Dh,h) <0. Thus h=0 since A, > 0. That is, kerD, = 4. Thus 

,=0= BIH and C,=A,P,=A|#%. Similarly, if A, <0, C,=0= 
A|\#, and D, = —A,P, = B\#H,. On H, T\% = 0 = Cy — Do. Thus C, 
= Dy. But 0 = C)D, = CZ. Thus 0 = (Coh,h) = ||Cohl|*, so Co = 0 = 
A|H# and Dy = 0 = B|%. Therefore C = A and D = B. a 


Positive operators are analogous to positive numbers. With this in mind, 
the next result seems reasonable. 


7.16. Theorem. Jf T is a positive compact operator, then there is a unique 
positive compact operator A such that A? = T. 


Proor. Let T = L°°_,A,,P,, as in the Spectral Theorem. Since T > 0, A,, > 0 
for all n (7.14). Let o(A,) =N/* and $(z) = 0 otherwise; put A = $(T). 
It is easy to check that A > 0; A = XXANY?P, so that A is compact; and 
Av = T, 

The proof of uniqueness is left to the reader. = 


EXERCISES 


1. If { P,} is a sequence of pairwise orthogonal nonzero projections and P = LP,, 
show that ||P — 24_, P| = 1 for all n. 


2. If # is separable, show that the definitions of a diagonalizable operator in (4.6) 
and (7.3) are equivalent. 


3. If A = La, P. as in (7.3), show that A is compact if and only if: (a) a, = 0 for 


all but a countable number of i; (b) P, has finite rank whenever a, # 0; (c) if 
{a,,@5,...} = {a,;: a, # 0}, then a, > 0 as n- oo. 


4. Prove Proposition 7.4. 


5. If A = ®.a,P,, show that A* = ©,4,P,, A is normal, and ||A|| = sup{|a,|: 
ie T}. 


6. Give the remaining details in the proof of (7.6). 


7. If A € @(#) and AT = TA for every compact operator 7, show that A is a 
multiple of the identity operator. 


8. Suppose T is a compact normal operator on a C-Hilbert space such that 
dim ker(T — A) < 1 for all A in C. Show that if A € @(#) and AT = TA, 
then A = $(T) for some ¢ in /“(C). 


9. Prove a converse to Exercise 8: if 7 is a compact normal operator such that 
{A € B(H#): AT = TA} = {6(T): ¢ € 1~(C©)}, then dimker(7 — A) < 1 for 

all A in C. 
10. Let T be a compact normal operator and show that ker(T — A) < 1 for all A in 


C if and only if there is a vector h in # such that { p(T)h: p is a polynomial 
in one variable} is dense in #. (Such a vector h is called a cyclic vector for T.) 
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11. If AX & C, let 6, be the unit point mass at A; that is, 6, is the measure on C 
such that 6,(4)=1 if AGA and 6,(A)=0 if AEA. If {A,,A,,...} are 
distinct complex numbers and {«a,,} is a sequence of real numbers with a, > 0 
and La, < oo, let p = L_,a,6, ; so p is a finite measure. If ¢ € /°(C), let 
M, be the multiplication operator on L*(). Define T: L*(p) > L*(p) by 
(Tf )(A,) =A, f(A,,). Prove: (a) T is a normal operator; (b) 7 has a cyclic 
vector (see Exercise 10); (c) if A © @(#) and AT = TA, then A = M, for 
some ¢ in /©(C); (d) T is compact if and only if A, — 0. (e) Find all of the 
cyclic vectors for 7. (f) If T is compact, find the decomposition (7.7) for T. 


12. Using the notation of Theorem 7.11, give necessary and sufficient conditions on 
T and ¢ that $(7) be compact. (Hint: consider separately the cases where ker T 
is finite or infinite dimensional.) 


13. Prove the uniqueness part of Theorem 7.16. 


14. If T € @(#), show that T*T > 0. 


15. Let T be a compact normal operator and show that there is a compact positive 
operator A and a unitary operator U such that T = UA = AU. Discuss the 
uniqueness of A and U. 


16. (Polar decomposition of compact operators.) Let T € &)(#) and let A be the 
unique positive square root of T*T [(7.16) and Exercise 14]. (a) Show that 
||AA|| = || 7h|| for all h in #. (b) Show that there is a unique operator U such 
that || UA|| = ||h|| when h 1 ker 7, Uh = 0 when h € kerT, and UA = T. (c) If 
U and A are as in (a) and (b), show that 7 = AU if and only if T is normal. 


17. Prove the following uniqueness statement for the functional calculus (7.11). If T 
is a compact normal operator on a C-Hilbert space # and 1: I[*(C) > @(H#) 
is a multiplicative linear map such that ||t()|| = sup{|@(A)||: A © 6,(7)}, 
T(/) = 1, and 7(z) = T, then r(¢) = $(T) for every $ in /*(C). 


§8*. Unitary Equivalence for Compact 
Normal Operators 


In Section I.5 the concept of an isomorphism between Hilbert spaces was 
defined as the natural equivalence relation on Hilbert spaces. This equiv- 
alence relation between the spaces induces a natural equivalence relation 
between the operators on the spaces. 


8.1. Definition. If A, B are bounded operators on Hilbert spaces #, #, 
then A and B are unitarily equivalent if there is an isomorphism U: #-> 
such that UAU~' = B. In symbols this is denoted by A = B. 


Some of the elementary properties of unitary equivalence are contained in 
Exercises 1 and 2. Note that if UAU~! = B, then UA = BU. 
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The purpose of this section is to give necessary and sufficient conditions 
that two compact normal operators are unitarily equivalent. Later, in 
Section IX.10, necessary and sufficient conditions that any two normal 
operators be unitarily equivalent are given and the results of this section are 
subsumed by those of that section. 


8.2. Definition. If T is a compact operator, the multiplicity function for T 
is the function m;: C > C U {co} defined by m,(A) = dimker(T — A). 


Hence m,(A) > 0 for all A and m;(A)> 0 if and only if A is an 
eigenvalue for T. Note that by Proposition 4.13, m;(A) < 00 if A # 0. 

If 7, S are compact operators on Hilbert spaces and U: #-> &# 1s an 
isomorphism with UTU~' = S, then U ker(T — A) = ker(S — A) for every 
A in C. In fact, if Th =Ah, then SUh = UTh = Uh and so UheE 
ker(S — A). Conversely, if k € ker(S—A) and h=U~'k, then Th = 
TU~'k = U~'Sk = Xh. In particular, it must be that m; = m.. If S and T 
are normal, this condition is also sufficient for unitary equivalence. 


8.3. Theorem. Two compact normal operators are unitarily equivalent if and 
only if they have the same multiplicity function. 


PRooF. Let 7, S be compact normal operators on Hilbert spaces #, %. If 
T = S, then it has already been shown that m,= m,. Suppose now that 
m,=m,. We must manufacture a unitary operator U: #-— # such that 
UTU '=S. 

Let T= U%_,A,P, and let S=L°_u,Q,, as in the Spectral Theorem 
(7.6). Soif in # m, then A, #A,, and pw, # p,,, and each of the projections 
P, and Q, has finite rank. Let Pj,Q,) be the projections of #, # onto 
ker T, kerS; so Py) = (LPP,)~ and QO, = (LPO,,)~ . Put Ay = by = O. 

Since m; = ms, 0 < m7(A,) = ms(A,,). Hence there is a unique p; such 
that wp, =A, Define 7: N > N by letting y,,, =A, Let 7(0) = 0. Note 
that 7 is one-to-one. Also, since 0 < m,(n,) = mr(H,,), for every n there is 
a j such that 7(j)=n. Thus 7: N U {0} > N U {0} is a bijection or 
permutation. Since dim P, = m7(A,,) = Ms(Han)) = dim Q,,,,, there is an 
isomorphism U,: P,#-> Q,,,,% for n = 0. Define U: #’°—> # by letting 
U = U,on P,# and extending by linearity. Hence U = ©°~_,U,,. It is easy 
to check that U is an isomorphism. Also, if h © P,#, n > 0, then UTh = 
Uh = ta (,,Uh = SUh. Hence UTU-'= S. 


If V is the Volterra operator, then m, = 0 (4.11) and V and the zero 
operator are definitely not unitarily equivalent, so the preceding theorem 
only applies to compact normal operators. There are no known necessary 
and sufficient conditions for two arbitrary compact operators to be unitarily 
equivalent. In fact, there are no known necessray and sufficient conditions 
that two arbitrary operators on a finite-dimensional space be unitarily 
equivalent. 
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EXERCISES 


1. 
Zz 


10. 


ll. 


12, 


Show that “unitary equivalence” is an equivalence relation on @(#%). 


Let U: #- X be an isomorphism and define p: B(#) — &(&) by p(A) = 
UAU'. Prove: (a) ||p(A)|| = |[A|], 0(A*) = p(A)*, and p is an isomorphism 
between the two algebras #(#) and @(#). (b) p(A) € Bo (#) if and only if 
A € B(#). (c) If T € B(#), then AT = TA if and only if p(T)p(A) = 
p(A)p(T). (d) If A € B(#) and “4 < #, then @ is invariant (reducing) for 
A if and only if U-@ is invariant (reducing) for p(A). 


. Say that an operator A on # is irreducible if the only reducing subspaces for A 


are (0) and #. Prove: (a) The Volterra operator is irreducible. (b) The unilateral 
shift is irreducible. 


. Suppose A = @{A4;:i € 1} and B= @{B;: i © J} where each A, and B, is 


irreducible (Exercise 3). Show that A = B if and only if there is a biyection 
a: I > I such that A; = B,,,). 


. If T is a compact normal operator and m,; = m is its multiplicity function, 


prove: (a) {A: m(A) > 0} is countable and 0 is its only possible cluster point; 
(b) m(A) < oo if A #0. Show that if m: C > N U {0,00} is any function 
satisfying (a) and (b), then there is a compact normal operator T such that 
my = mM. 


. Show that two projections P and Q are unitarily equivalent if and only if 


dim(ran P) = dim(ran Q) and dim(ker P) = dim(ker Q). 


. Let A: L7(0,1) > L*(0,1) be defined by (Af)(x) = xf(x) for f in L7(0,1) and 


x in (0,1). Show that A = A’. 


. Say that a compact normal operator T is simple if m; < 1. (See Exercises 7.10 


and 7.11.) Show that every compact eae operator T on a separable Hilbert 
space is unitarily equivalent to ©” ,7,, where each 7, is a simple compact 
normal operator and m; > mr, | for all n. Show that ||7,|| > 0. (Of course, 
there may only be a finite number of T,,.) 


. Using the notation of Exercise 8, suppose also that S is a compact normal 


operator and S = ®™_|S,, where S, is a simple compact normal operator and 
ms >msgs _ for all n. Show that T = S if and only if T, = S, for all n. 


If 7 is a compact normal operator on a separable Hilbert space, show that there 
are simple compact normal operators 7,,7),... such that T= 0 T, @ T{? ® 
T® ® ---, where: (a) for any operator A, A") = A ® --- @A (n times); (b) 0 
is the zero ee he on an infinite-dimensional space; (© te forn#k, m;m,, = 0; 
and (d) if ker 7 is infinite dimensional, then ker T, = (0) for all n. (Of course 
not all of the summands need be present.) Show that ||7,|| — 0. 


Using the notation of Exercise 10, let S be a compact normal operator and let 
0@ S, ® SY @® --- be the corresponding decomposition. Show that T= S if 
and only if 7, = S, and kerT and kerS have the same dimension. 


If T is a compact normal operator, show that T and T © T are not unitarily 
equivalent. 
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13. Give an example of a nontrivial operator T such that T = T ® T. Show that if 
T=T@T, then T=7T@®T7@® ---. Characterize the diagonalizable normal 
operators T such that T = T © T. 


14. Let # be the space defined in Example I.1.8 and let U: #-— L?(0,1) be the 


isomorphism defined by Uf = f’ (Exercise 1.1.4). If (Af)(x) = xf(x) for f in 
HH, what is UAU™'? 


CHAPTER III 


Banach Spaces 


The concept of a Banach space is a generalization of Hilbert space. A 
Banach space assumes that there is a norm on the space relative to which 
the space is complete, but it is not assumed that the norm is defined in terms 
of an inner product. There are many examples of Banach spaces that are not 
Hilbert spaces, so that the generalization is quite useful. 


§1. Elementary Properties and Examples 


1.1. Definition. If 2 is a vector space over F, a seminorm is a function 
p: #-— [0, co) having the properties: 


(a) p(x + y) < p(x) + p(y) for all x, y in &. 
(b) p(ax) = |al|p(x) for all a in F and x in @. 


It follows from (b) that p(0) = 0. A norm is a seminorm p such that 
(c) x = Oif p(x) =0. 


Usually a norm is denoted by || - ||. 

The norm on a Hilbert space is a norm. Also, the norm on 4(#) is a 
norm. 

If @ has a norm, then d(x, y) = ||x — y|| defines a metric on Z. 


1.2. Definition. A normed space is a pair (2, || - ||), where 2 is a vector 
space and || - || is anorm on 2%. A Banach space is a normed space that is 
complete with respect to the metric defined by the norm. 
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1.3. Proposition. Jf Z is a normed space, then 


(a) the function & X 4-— & defined by (x, y)'> x + y is continuous; 
(b) the function F X #— & defined by (a, x) > ax is continuous. 


ProoF. If x, — x and y, > y, then ||(x, + y,) — (x + y)|] = I(x, — x) + 
(y, ~ Yl < Ix, — xll + lly, — yl] 2 0 as n> oo. This proves (a). The 
proof of (b) is left to the reader. i 


The next lemma is quite useful. 


1.4. Lemma. If p and gq are seminorms on a vector space 2, then the 
following statements are equivalent. 


(a) p(x) < q(x) forall x. (That is, p < q.) 
(b) {x © 2: q(x) <)}C {x © Z: p(x) < I}. 
(b’) p(x) < 1 whenever q(x) < 1. 
(c) {x: g(x) <1} (x: p(x) < 1). 
(c’) p(x) < 1 whenever g(x) < 1. 
(d) {x: g(x) <1} {x: p(x) < I}. 
(d’) p(x) < 1 whenever q(x) < 1. 
PROOF. It is clear that (b) and (b’), (c) and (c’), and (d) and (d’) are 
equivalent. It is also clear that (a) implies all of the remaining conditions 
and that (b) implies (d). It will be shown that (d) implies (a). The proof that 
(c) implies (a) is left as an exercise. 

Assume that (d) holds and put g(x) = a. If e > 0, then g((a + €) 'x) = 
(a + ©) la < 1. By (d), 1 > p((a t+ &) 7 'x) = (ate) 'p(x), so p(x) <a 
+ e¢= q(x) + «. Letting e — 0 shows (a). = 


If || - ||, and || - |], are two norms on 2%, they are said to be equivalent 
norms if they define the same topology on 2%. 


1.5. Proposition. Jf || - ||, and || - ||, are two norms on 2, then these norms 

are equivalent if and only if there are positive constants c and C such that 
e|[x|]1 S [lxIlo S Cll 

for all x in &, 


PROOF. Suppose there are constants c and C such that c||x||, < ||x||, < 
C||x||, for all x in @. Fix x, in 2, e > 0. Then 


{x € #: ||x — xolly < e/C} C {x EB: ||x — xoll2 < e}, 
(xe 2 eH ab ee eee 2 sexi Se). 


This shows that the two topologies are the same. Now assume that the two 
norms are equivalent. Hence { x: ||x||,; < 1} is an open neighborhood of 0 in 
the topology defined by || - ||,. Therefore there is an r > 0 such that {x: 
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Ixllo <r} S {x |lxlly < 1}. If g(x) = “|x|, and p(x) = ||x\|,, the pre- 
ceding lemma implies ||x||, <r *||x|], or cl|x||, < ||x||,, where c = r. The 
other inequality is left to the reader. | 


There are two types of properties of a Banach space: those that are 
topological and those that are metric. The metric properties depend on the 
precise norm; the topological ones depend only on the equivalence class of 
norms (see Exercise 4). 


1.6. Example. Let X be any Hausdorff space (all spaces in this book are 
assumed to be Hausdorff unless the contrary is specified) and let C,( X) = all 
continuous functions f: X — F such that ||f]| = sup{|f(x)|: x © X} < oo. 
For f,g in C,(X), define (f+ g): X > F by (f + g)(x) = f(x) + g(x); 
for a in F define (af )(x) = af(x). Then C,(X) is a Banach space. 


The proofs of the statements in (1.6) are all routine except, perhaps, for 
the fact that C,(X) is complete. To see this, let { f,} be a Cauchy sequence 
in C,(X). So if e > 0, there is an integer N. such that for n,m> N., 
e> |lf, - fall = suptli.(x) -—f,,(%)|: x © X}. In particular, for any x in 
XU) — fn 1 S Why — fyll < € when n,m > N,. So { f,(x)} is a Cauchy 
sequence in F. Let f(x) = lim f(x) if x © X. Now fix x in X. If n,m>QN_, 
then [f(x) — fx()] < (0) = fn OL + Uh — Lull < [fC — fa] + €. 
Letting m — oo gives that |f(x) — f,(x)| < e when n => N.. This is indepen- 
dent of x. Hence ||f— f,|| < ¢ for n = N.. 

What has been just shown is that ||f — f,|| ~ 0 as n > oo. Note that this 
implies that f(x) — f(x) uniformly on X. It is standard that f is continu- 
ous. Also, ||f|| < llf—- All + IIAl| < co. Hence f © C,(X) and so C,(X) is 
complete. 

Note that a linear subspace Y of a Banach space % that is topologically 
closed is also a Banach space if it has the norm of %. 


1.7. Proposition. If X is a locally compact space and C,( X ) = all continuous 
functions f: X — F such that for all e > 0, {x © X: |f(x)| = €} is compact, 
then C)(X) is a closed subspace of C,( X) and hence is a Banach space. 


PROOF. That C)(X) is a linear manifold in C,(X) is left as an exercise. It 
will only be shown that C)(X) is closed in C,(X). Let {f,} C Cy(X) and 
suppose f, > f in C,(X). If ¢ > 0, there is an integer N such that 
lf, —SI| < €/2; that is, |f,(x) — f(x)| < ¢/2 for all n > N and x in X. If 
lf(x)| = e, then e < |f(x) -—f,(x) + f,(x)| < €/2 + |f,(x)| for n => N; so 
lf,(x)| = e/2 for n > N. Thus, {x © X: |f(x)| =e} C {x © X: |fy(x)| = 
e/2\ so that fe Co(X). = 


The space C,(X) 1s the set of continuous functions on X that vanish at 
infinity. If X = R, then C,(R) = all of the continuous functions f: R — F 
such that lim, _, , ,, f(x) = 0. If X is compact, Co(X) = C,(X) = CCX). 
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If J is any set, then give J the discrete topology. Hence J becomes locally 
compact. Also any function on J is continuous. Rather than C,(/), the 
customary notation is /~*(/). That is, /°(7) = all bounded functions f: 
I> F with ||/fl| = sup{|f(@)|: ie 7}. CoV) consists of all functions f: 
I — F such that for every e > 0, {i € J: |f(i)| = e} 1s finite. If 7 = N, the 
usual notation for these spaces is /* and cy. Note that /* consists of all 
bounded sequences of scalars and c, consists of all sequences that converge 
to 0. 


1.8. Example. If (X,92,) is a measure space and 1 < p < oo, then 
L?’(X, Q, w) is a Banach space. 


The preceding example is usually proved in courses on integration and no 
proof is given here. 


1.9. Example. Let J be a set and 1 < p < oo. Define /”(/) to be the set of 
all functions f: J > F such that Y{|f(i)/?: i © 1} < 00; and define |||, = 
(L{{f@) PP: ie 1})'/?. Then /?(J) is a Banach space. If J =N, then 
1?(N) = LT’. 


If 2 = all subsets of J and for each A in (2, »(4) = the number of points 
in A if A is finite and u(A) = oo otherwise, then /?(J) = L?(J, 92, p). So 
the statement in (1.9) is a consequence of the one in (1.8). 


1.10. Example. Let n > 1 and let C‘” [0,1] = the collection of functions 
f: [0,1] > F such that f has n continuous derivatives. Define ||f]| = 
SUPy <, <n (SuP{|f (x): 0 < x < 1}}. Then C“ (0, 1] is a Banach space. 


1.11. Example. Let 1 < p < 00 and n > 1 and let W,"[0, 1] = the functions 
f: [0,1] > F such that f has n—1 continuous derivatives, f(~" is 
absolutely continuous, and f‘”) © L?[0,1]. For f in W,”[0, 1], define 
if 1/p 
inh = | Pe(xyr ax] 
k=0L°9 
Then W,"{0, 1] is a Banach space. 


The following is a useful fact about seminorms. 


1.12. Proposition. If p is a seminorm on %, |p(x) — p(y)| < p(x — y) for 
all x, y in &. If || - || is a norm, then | ||x|| — |lyll | < |lx — yll for all x, y 
in &. 

PROOF. Of course, the inequality for norms is a consequence of the one for 
seminorms. Note that if x,y & 2%, p(x)=p(x-—yty)<p(x-y)+t 


p(y), so p(x)— p(y) < p(x — y). Similarly, p(y) — p(x) s p(x — y). 
|| 
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There is the concept of “isomorphism” for the category of Banach spaces. 


1.13. Definition. If 2 and Y are normed spaces, 2 and ¥Y are isometri- 
cally isomorphic if there is a surjective linear isometry from % onto &. 


The term isomorphism in Banach space theory is reserved for linear 


biyections T: 2— ¥Y that are homeomorphisms. 


EXERCISES 


1. 
2, 


13. 


. Let c = the set of all sequences {a,}, a 


Complete the proof of Proposition 1.3. 


Complete the proof of Proposition 1.5. 


. For 1<p<oo and x =(x,...,x7) in F%, define |)x\|, = 2424|x,/? 1’; 


define ||x||,. = sup{|x,|: 1 <j < d}. Show that all of these norms are equiv- 
alent. For 1 <p, q < o, what are the best constants c and C such that 
c\lxll, < lxll, < Cllxl|, for all x in F4? 


. If 1 <p < o and ||- ||, is defined on R? as in Exercise 3, graph {x € R°: 


|x||, = 1}. Note that if 1 < p < 00, ||x||, = |lyll, = 1, and x #y, then for 
O<7< 1, |\tx + (1 — t)yll, < 1. The same cannot be said for p = 1, 00. 


, nF, such that lima, exists. Show 
that c is a closed subspace of /* and hence is a Banach space. 


. Let X = {n'': n> 1} U {0}. Show that C(X) and the space of c of Exercise 5 


are isometrically isomorphic. 


(a) Show that if 1 < p < oo and J is an infinite set, then /?(/) has a dense set 
of the same cardinality as J. 

(b) Show that if 1 < p < 00, /?(J) and /?(J) are isometrically isomorphic if 
and only if J and J have the same cardinality. 


. If /°(7) and /°(J) are isometrically isomorphic, do J and J have the same 


cardinality? 


. Show that /* is not separable. 

. Complete the proof of Proposition 1.7. 
. Verify the statements in Example 1.10. 
. Verify the statements in Example 1.11. 


. Let X be locally compact and let X,, = X U {co} be the one-point compactifi- 


cation of X. Show that C)(X) and { f © C(X,,): f(co) = 0}, with the norm it 
inherits as a subspace of C(X,,), are isometrically isomorphic Banach spaces. 


Let X be locally compact and define C.( X) to be the continuous functions f: 
X — F such that spt f= cl{x © X: f(x) # 0} is compact (spt f is the support 
of f). Show that C.(X) is dense in C)(X). 


70 III. Banach Spaces 


14. If X is a metrizable locally compact space that is o-compact, then C,)(X) is 
separable. (X is o-compact if X = U%_,K,, where each K,, is compact.) 


15. If W,"[0,1] is defined as in Example 1.11 and fe W,"[0,1], let ||/fIll = 
[fF Cx)P dx}? + [ff (x)P dx]7?. Show that ||] - ||| is equivalent to the 
norm defined on W,"[0, 1]. 


16. Let 2 bea normed space and let % be its completion as a metric space. Show 
that 2 is a Banach space. 


§2. Linear Operators on Normed Spaces 


This section gathers together a few pertinent facts and examples concerning 
linear operators on normed spaces. A fuller study of operators on Banach 
spaces will be pursued later. 

The proof of the first result is similar to that of Proposition I.3.1 and is 
left to the reader [Also see (II.1.1)]. @(%,¥Y%) = all continuous linear 
transformations A: > &. 


2.1. Proposition. Jf and Y are normed spaces and A: X—> Y is a linear 
transformation, the following statements are equivalent. 

(a) AGC BF,Y). 

(b) A is continuous at 0. 

(c) A is continuous at some point. 


(d) There is a positive constant c such that ||Ax|| < c||x|| for all x in &. 
IfA = B(L,Y) and 


|A|| = sup{||Ax||: |]x]] < 1}, 
then 
|All 


sup {||Ax||: |Ixl] = 1} 
sup {||Ax||/llxl]: x # 0} 
= inf{¢ > 0: ||Ax|] < ¢|lx|| for xin £}. 


||A|| is called the norm of A and 4(#,Y%) becomes a normed space if 
addition and scalar multiplication are defined pointwise. @(%, Y) is Banach 
space if Y is a Banach space (Exercise 1). A continuous linear operator is 
also called a bounded linear operator. 

The following examples are reminiscent of those that were given in 
Section II.1. 


2.2. Example. If (X, 2, ») is a o-finite measure space and ¢ © L”(X,Q, p), 
define M,: L?(X,Q,") > L?(X,2, py), 1 <p < «0, by Mj f= of for all f 
in L?(X,%,p). Then M, © B(L?(X, 2, w)) and ||Mgl| = |l4ll..- 
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2.3. Example. If (X,02,u), k, c,, and c, are as in Example IJ.1.6 and 
1<p< _o, then K: L?(p) > L?(p), defined by 


(Kf )(x) = [k(x ») f(y) daly) 
for all f in L?(p) and x in X, is a bounded operator on L?(p) and 
KI] < c}/%c/?, where 1/p + 1/q = 1. 


2.4. Example. If X and Y are compact spaces and t: Y— X is a 
continuous map, define A: C(X)— C(Y) by (Af)(y) = f(7(y)). Then 
A €& &C(X), C(Y )) and ||A]| = 1. 


EXERCISES 
1. Show that 4(%, Y) is a Banach space if and only if Y is a Banach space. 


2. Let % be a normed space, let Y be a Banach space, and let € be the completion 
of &. Show that if p: @(%, Y) ~ B(%, Y) is defined by p(A) = A|%, then p is 
an isometric isomorphism. 


3. If (X, 2, p) is a o-finite measure space, ¢: X — F is an (2-measurable function, 
1 <p < o, and of € L?() whenever f € L?(n), then show that ¢ € L®(p). 


. Verify the statements in Example 2.2. 
. Verify the statements in Example 2.3. 


. Verify the statements in Example 2.4. 


I DW 


. Let A and 7 be as in Example 2.4. (a) Give necessary and sufficient conditions 
on 7 that A be injective. (b) Give such a condition that A be surjective. (c) Give 
such a condition that A be an isometry. (d) If X = Y, show that A* = A if and 
only if 7 is a retraction. 


§3. Finite-Dimensional Normed Spaces 


In functional analysis it it always good to see what significance a concept 
has for finite-dimensional spaces. 


3.1. Theorem. If % is a finite-dimensional vector space over F, then any two 
norms on & are equivalent. 


ProoF. Let {e,,...,e,} be a Hamel basis for 2. For x = L4_,x,e,, define 
Ix||.. = max{|x,|: 1 <j < d}. It is left to the reader to verify that || - ||, is 
a norm. Let || - || be any norm on 2%. It will be shown that || - || and || - ||, 


are equivalent. 
If x= L,x,e,, then |[x\| < Z|x,| [lel] < Cla]... when C = Y,lle,l|. To 


show the other inequality, let 7 be the topology defined on % by || - ||, 
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and let WY be the topology defined on % by ||- ||. Put B= {xE@: 
|x||., < 1}. The first part of the proof implies 7D WY. Since B is Acompact 
and 7D &Y, B is Y-compact and the relativizations of the two topologies to 
B agree. Let A = {x © 2: ||x||,, < 1}. Since A is Aopen, it is open in 
(B, WZ). Hence there is a set U in Y such that UN B = A. Thus 0 © U and 
there is an r > 0 such that {x € @: ||x|| < r} C U. Hence 


3.2 \|x|| <r and ||x||,, < l implies ||x||,, < 1. 


Claim. ||x|| < 7 implies ||x||,, < 1. 


Let ||x|| <r and put x = Lx,e,, a = ||x||,,- So ||x/a||,, = land x/a € 
B. If a >1, then ||x/al| < r/a <r, and hence ||x/al|, <1 by (3.2), a 
contradiction. Thus ||x||,, = a < 1 and the claim is established. 

By Lemma 1.4, ||x||,, <r '||x|] for all x and so the proof is complete. 


3.3. Proposition. Jf % is a normed space and is a finite dimensional 
linear manifold in %, then M is closed. 


PROOF. Let x)» © ¥\.@ and put 4, = the linear span of .Z and {x}. 
Define a norm || - ||, on 4, by ||x + a@pxXoll, = ||xl| + laol, for x in 4 
and a, in F. It is left as an exercise to show that || - ||, 1s anorm on .4,. By 
Theorem 3.1 and Proposition 1.5, there are constants c and C such that 
c\|x + apXoll < ||x]] + lal < Cllx + a oxo|| for all x in 4 and ap in F. 
Hence for all x in .Z, ||x9 — x|| => C7*(\|x|] + 1) = C7. Thus0 < C7’ < 
inf{||x 9 — x||: x € W} = dist(x9, .@). That is, every point x, not in 7 is 
at a positive distance from .@. Hence -@ is closed. i 


3.4. Proposition. Let % be a finite-dimensional normed space and let Y be 
any normed space. If T: &—> YW is a linear transformation, then T is 
continuous. 


PROOF. Since all norms on ¥ are equivalent and 7: 4-— Y is continuous 
with respect to one norm on 2 precisely when it is continuous with respect 
to any equivalent norm, we may assume that ae jel = max{|é |: 1 <7 
< d}, where {e,} is a Hamel basis for 2. Thus, for x = Léje;, ||Tx|| = 
|X € Te || < UNE | Tell < Cllx|], where C = L||Te,||. By (2.1), T is con- 
tinuous. a 


EXERCISES 
1. Show that if & is a locally compact normed space, then 2 is finite dimensional. 


2. Show that || - ||,, defined on 4, in the proof of Proposition 3.3, is a norm. 
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§4. Quotients and Products of Normed Spaces 


Let & be a normed space, let 4 be a linear manifold in %, and let Q: 
X— #/M be the natural map Ox = x + .@. We want to make 2/4 intoa 
normed space, so define 


4.1 Ix + MA = inf{||x + yl]: y |Z}. 


Note that because -@ is a linear space, ||x +-4| = inf{||x — yl: y EZ} 
= dist(x,.#), the distance from x to 4. It is left to the reader to show that 
(4.1) defines a seminorm on 2/4. But if 7 is not closed in 2, (4.1) cannot 
define a norm. (Why?) If, however, -Z is closed, then (4.1) does define a 
norm. 


4.2. Theorem. If 4 < % and ||x + M| is defined as in (4.1), then || - || is a 
norm on 2/4. Also: 


(a) ||OCx)|| < ||x|| for all x in & and hence Q is continuous. 

(b) If & is a Banach space, then so is 2/M. 

(c) A subset W of &/M is open relative to the norm if and only if Q~\(W) is 
open in &, 

(d) If U is open in &, then Q(U) is open in 2/M. 


PROOF. It is left as an exercise to show that (4.1) defines a norm on 2/4. 
To show (a), ||Q(x)|| = ||x +Z]| < ||x|| since 0 €.4; Q is therefore 
continuous by (2.1). 

(b) Let {x, +.#} be a Cauchy sequence in 2/4. There is a subsequence 
{x,, +} such that 


(Xn, +M)—-(x, +A) = 5 in ee 


N+ N+ 


Let y, = 0. Choose y, in @ such that 
Xn, — Xn + Vall S Xn, — Xn, FMI) +27 < 2-27. 
Choose y, in 4 such that 
(xn, + Y2) — (Xn, + Vail S len, — Xn, +A) +277 < 2-277, 


Continuing, there 1s a sequence { y, } in 4 such that 
I(x, te Vx) 2A Ki a Yerrll ae 


Thus {x,, + y,} is a Cauchy sequence in 2 (Why?). Since 2 is complete, 
there is an x) in 2% such that x, + y, > X 9 in %. By (a), x, + @= 
Q(x, + ¥,) 7 OX = Xy + HM. Since {x, +M} is a Cauchy sequence, 
X, +M—> x. +4 and Z/M is complete (Exercise 3). 

(c) If W is open in 2/4, then Q-'(W) is open in 2% because Q is 
continuous. Now assume that W c 2/4 and Q-'(W) is open in 2. Let 
r>0Oand put B.= {x © 2: ||x|| <r}. It will be shown that Q(B.) = {x 
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+M: \\x + M|| <r}. In fact, if ||x|| <7, then ||x +4|| < ||x|| <r. On the 
other hand, if ||x +-4@|| <r, then there isa y in @ such that ||x + yl| <r. 
Thus x +. 47= QO(x+y)e€ O(B). If x, + WE W, then x, € Q-'(W). 
Since Q~ 1(W) is open, there is an r > 0 such that xy + B, = {x: ||x — xl 
<r}cC@Q '\(W). The preceding argument now implies that W = 
OO-'(W) 2D O(x, + B) = {x +H |x -— x9 +B <r}. Hence W is 
open. 

(d) If U is open in 2%, then Q-'(Q(U))=U+ M2 {ut+y: ue€ U, 
yEM)=U+y: y © MW}. Each U + y is open, so Q-'(Q(U)) is open 
in &. By (c), O(U) is open in 2/74. a 


Because Q is an open map [part (d)], it does not follow that Q is a closed 
map (Exercise 4). 


4.3. Proposition. Jf 2 is a normed space, M<%, and WN is a finite 
dimensional subspace of &, then M+ N is a closed subspace of 2X. 


Proor. Consider 2/M@ and the quotient map Q: 2-2/4. Since 
dimQ(V) < dim V< o, QO(W) is closed in &/@. Since Q is continuous 
O~'(O(/)) is closed in ; butO°'(O(VY)) =AW+N. 


Now for the product or direct sum of normed spaces. Here there is a 
difficulty because, unlike Hilbert space, there is no canonical way to 
proceed. Suppose { 2: i € J} is a collection of normed spaces. Then [I{ 2;: 
i © I} is a vector space if the linear operations are defined coordinatewise. 
The idea is to put a norm on a linear subspace of this product. 

Let || - || denote the norm on each 2;. For 1 < p < 00, define 


©, 2, = f eT 12;: Ix = Eaxcine] < a 
Define 
@,,£, = {x eT 12; [all = suplix(i)|| < 00}. 
If {2%,, %, ...} 1s a sequence of normed spaces, define 
G.2.= f E I, x(n) > oh 


give ®,%, the norm it has as a subspace of ©, 7, 
The proof of the next proposition is left as an exercise. 


4.4. Proposition. Let {2%,: i < I} be a collection of normed spaces and let 
= 0,4, 1 5p 5 w. 


(a) & is a normed space and the projection P,;; & > 4, is a continuous linear 
map with ||P,(x)|| < ||x|| for each x in &. 


III.4. Quotients and Products of Normed Spaces ie 


(b) & is a Banach space if and only if each 2, is a Banach space. 


I 


(c) Each projection P, is an open map of Z onto &,. 


A similar result holds for ®,%,, but the formulation and proof of this is 


left to the reader. 


EXERCISES 


1. 
2: 


10. 


ll. 


12. 


13. 


14. 


Show that if JW < %, then (4.1) defines a norm on 2/%. 


Prove that 2 is a Banach space if and only if whenever { x, } is a sequence in & 
such that )'||x,|| < oo, then L°°_, x, converges in Z. 


. Show that if (X,d) is a metric space and { x,,} is a Cauchy sequence such that 


there is a subsequence {x,, } that converges to x9, then x, > Xo. 


. Find a Banach space 2% and a closed subspace -@ such that the natural map Q: 


4 — 4/M is not a closed map. Can the natural map ever be a closed map? 


. Prove the converse of (4.2b): If % is a normed space, 4 < #, and both .@ and 


4/M are complete, then % is complete. (This is an example of what is called a 
“two-out-of-three” result. If any two of 2, 4, and &/@ are complete, so is the 
third.) 


. Let M={xeEl?’: xQn)=0 for all n}, 1 < p < oo. Show that /?/7@ is 


isometrically isomorphic to /?. 


. Let X be a normal locally compact space and F a closed subset of X. If 


M= {fe G(X): f(x) = 0 for all x in F}, then C(X)/Z is isometrically 
isomorphic to Cy(F). 


. Prove Proposition 4.4. 


. Formulate and prove a version of Proposition 4.4 for ©, ,,. 


If {2,,...,%,,} is a finite collection of normed spaces and 1 < p < o, show 
that the norms on © 2, are all equivalent. 


Here is an abstraction of Proposition 4.4. Suppose {%;: i © J} is a collection of 
normed spaces and Y is a normed space contained in F’. Define #= {x € 11,2: 
there is a y in Y with ||x(i)|| < y(/) for all i}. If x © %, define ||x|| = inf{||y||: 
l|x(i)|| < y() for all i}. Then (2, || - ||) is a normed space. Give necessary and 
sufficient conditions on Y that each of the parts of (4.4) are valid for %. 


Let 2 be a normed space and 4 < @. (a) If % is separable, so is %/%@. (b) If 
4/M and M are separable, then % is separable. (c) Give an example such that 
4/M is separable but 2% is not. 


Let 1 < p < oo and put ¥= © 4. Show that # is separable if and only if J is 
countable and each %, is separable. Show that ©, 2%; is separable if and only 
if J is finite and each %, is separable. 


Show that ©,2%,, is separable if and only if each %, is separable. 
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15. Let JCJ, and = OF: i€ 1}, M= {x EX: x(j)=0 for j in J}. 
Show that 2/M is isometrically isomorphic to © ,{%;: j € J}. 


16. Let # be a Hilbert space and suppose 4 < #. Show that if 0: #- H/M is 
the natural map, then 0: 4* — #H/M is an isometric isomorphism. 


§5. Linear Functionals 


Let & be a vector space over F. A hyperplane in % is a linear manifold 4 
in & such that dim( 2/7) = 1. If f: #- F is a linear functional and 
f #0, then kerf is a hyperplane. In fact, f induces an isomorphism 
between %/ker f and F. Conversely, if WZ is a hyperplane, let 0: 2 —> 2/4 
be the natural map and let 7: 2/,@-— F be an isomorphism. Then 
f = T°Q isa linear functional on 2 and kerf =-Z. 

Suppose now that f and g are linear functionals on 2% such that 
ker f = kerg. Let x) € & such that f(x.) = 1; so g(x,) # 0. If x © and 
a = f(x), then x — ax, € kerf = kerg. So 0 = g(x) — ag(x 9), or g(x) = 
(g(X,))a = (g(X_)) f(x). Thus g = Bf for a scalar B. This is summarized as 
follows. 


5.1. Proposition. A linear manifold in & is a hyperplane if and only if it is 
the kernel of a linear functional. Two linear functionals have the same kernel if 
and only if one is a nonzero multiple of the other. 


Hyperplanes in a normed space fall into one of two categories. 


5.2. Proposition. Jf % is a normed space and M is a hyperplane in 2, then 
either M is closed or M is dense. 


Proor. Consider cl.@, the closure of .4. By Proposition 1.3, cl.# is a 
linear manifold in %. Since 4 C cl.@ and dim 2/4 = 1, either cl Z= 4 
orcl H= 2. a 


If = c, and f: 4-—F is defined by f(a,, a), ...) = a,, then kerf = 
{(a,,) © Co: a, = 0} is closed in cy. To get an example of a dense hyper- 
plane, let 2 = c, and let e, be the element of cy such that e,(k) = 0 if 
k #n and e,(n) = 1. (It is best to think of cy, as a collection of functions 
on N.) Let x)(7) = 1/n for all n; so x9 € cy and {Xo,e),@5,...} 18 a 
linearly independent set in cy. Let # = a Hamel basis in cy which contains 
{X,€1,€5,...}. Put B= (Xo, €1,€5,..-} U{b: ie I}, b, # xq or e,, for 
any i or n. Define f: cy > F by f(agpxo + Lo_,a,e, + X;8;b;) = ao. (Re- 
member that in the preceding expression at most a finite number of the a, 
and £8, are not zero.) Since e, € kerf for all n > 1, kerf is dense but 
clearly ker f # co. 
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The dichotomy that exists for hyperplanes should be reflected in a 
dichotomy for linear functionals. 


5.3. Theorem. Jf 2 is a normed space and f: 2 — F is a linear functional, 
then f is continuous if and only if ker f is closed. 


Proor. If f is continuous, ker f = f ‘({0}) and so kerf must be closed. 
Assume now that kerf is closed and let 0: 2— 2/kerf be the natural 
map. By (4.2), Q is continuous. Let 7: %/ker f — F be an isomorphism; by 
(3.4), T is continuous. Thus, if g = T°Q: 2-F, g is continuous and 
ker f = ker g. Hence (5.1) f = ag for some a in F and so f is continuous. 

2 


If f: &— F 1s a linear functional, then f is a linear transformation and 
so Proposition 2.1 applies. Continuous linear functionals are also called 
bounded linear functionals and 


fll = sup {If(x)I: [xl] < 1}. 


The other formulas for ||/|| given in (2.1) are also valid here. Let 2 * = the 
collection of all bounded linear functionals on 2. If f,g <= %* anda eé F, 
define (af + g\(x) = af(x)+ g(x); #* is called the dual space of 2. 
Note that 2 * = &(Z,F). 


5.4. Proposition. Jf % is a normed space, * is a Banach space. 


PROOF. It is left as an exercise for the reader to show that 2* is a normed 
space. To show that 2* is complete, lett B= {x E22: ||x|| < 1}. If 
fEeX*, define p(f): BF by p(f)(x)= f(x); that is, p(f) is the 
restriction of f to B. Note that p: #* — C,(B) is a linear isometry. Thus 
to show that 2* is complete, it suffices, since C,(B) is complete (1.6), to 
show that p(%*) is closed. Let {f,} C 2* and suppose g © C,(B) such 
that ||o(f,) — g|| ~ 0 as n> oo. Let x © %. If a,B EF, a, B #0, such 
that ax, Bx © B, then a ‘g(ax) = lima 'f,(ax) = lim B UL,(Bx) = 
B~‘g( Bx). Define f: £—> F by letting f(x) = a7 'g(ax) for any a # 0 such 
that ax © B. It is left as an exercise for the reader to show that fe %* 
and p(f)=g. & 


Compare the preceding result with Exercise 2.1. 

It should be emphasized that it is not assumed in the preceding proposi- 
tion that & is complete. In fact, if % is a normed space and & is its 
completion (Exercise 1.16), then 2* and %* are isometrically isomorphic 
(Exercise 2.2). 


5.5. Theorem. Let (X,2,p) be a measure space and let 1<p< oo. If 
1/p + 1/q =1andg € L"(X,9,u), define F,: L?(w) > F by 


F,(f) = ffadp. 
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Then F, © L?()* and the map g > F, defines an isometric isomorphism of 
L4(p) onto L?(p)*. 


Since this theorem is often proved in courses in measure and integration, 
the proof of this result, as well as the next two, is contained in the 
Appendix. See Appendix B for the proofs of (5.5) and (5.6). 


5.6. Theorem. Jf (X, 2, w) is a o-finite measure space and g © L®(X, M2, L), 
define F,: L'(w) > F by 


F,(f) = {fgdu. 


Then F, © L(u)* and the map g > F,, defines an isometric isomorphism of 
L®©() onto L'(p)*. 


Note that when p = 2 in Theorem 5.5, there is a little difference between 
(5.5) and (1.3.5) owing to the absence of a complex conjugate in (5.5). Also, 
note that (5.6) is false if the measure space is not assumed to be o-finite 
(Exercise 3). 

If X is a locally compact space, M(X) denotes the space of all F-valued 
regular Borel measures on X with the total variation norm. See Appendix C 
for the definitions as well as the proof of the next theorem. 


5.7. Riesz Representation Theorem. Jf X is a locally compact space and 
© M(X), define F,: Co(X) > F by 


Ff) = [ fap. 


Then F, € C,(X)* and the map > F, is an isometric isomorphism of 
M(X) onto C,(X)*. 


There are special cases of these theorems that deserve to be pointed out. 


5.8. Example. The dual of cy is isometrically isomorphic to /'. In fact, 
Cy = C,(N), if N is given the discrete topology, and /* = M(N). 


5.9. Example. The dual of /’ is isometrically isomorphic to /*”. In fact, 
J} = L1(N,2%, w), where »(A) = the number of points in A. Also, /* = 
L~(N, 2%, w). 


5.10. Example. If 1 < p < oo, the dual of /” is /%, where 1 = 1/p + 1/4. 


What is the dual of L*(X, §2, 1)? There are two possible representations. 
One is to identify L°(X, Q, w)* with the space of finitely additive measures 
defined on Q that are “absolutely continuous” with respect to p and have 
finite total variation (see Dunford and Schwartz [1958], p. 296). Another 
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representation is to obtain a compact space Z such that L’(X,, p) is 
isometrically isomorphic to C(Z) and then use the Riesz Representation 
Theorem. This will be done later in this book (VIII.2.1). 

What is the dual of M(X)? For this, define L*( M(X)) as the set of all F 
in T1{ L°(p): wp € MCX)} such that if p< vp; then F(p) = F(v) ae. [p]. 
This is an inverse limit of the spaces L®(w), p in MCX). 


5.11. Lemma. If F © L®(M(X)), then 


Fil = sup||F(p)Il. < 0. 
iv 
PROOF. If ||F|] = 00, then there 1s a sequence {p,} in M(X) such that 
Fe. Zn. Let w= 22° "|p, |/llvll Then w, <p for all n, so 
F(u,,) = F(w) ae. [p,] for each n. Hence |/F(H)llo. =F Uwlloo 2” for 
each n, a contradiction. a 


5.12. Theorem. Jf X is locally compact and F © L®(M(X)), define ®,: 
M(X) > F by 


®,() = [F(u) dp. 


Then ®, © M(X)* and the map F > ©, is an isometric isomorphism of 
L®(M(X)) onto M(X)*. 


PROOF. It is easy to see that ®, is linear. Also, |®,(p)| < /|F(p)| dlp] < 
FCH)Ileollall < [Fil (lull. Thus ©, € M(X)* and ||®;|| < {IFIl. 

Now fix ® in M(X)*. If » © M(X) and fe L'(\p\), then v = fu € 
M(X). (That is, v(A) = {, fdu for every Borel set A.) Also ||p|| = /{f| aly. 
In fact, the Radon—Nikodym Theorem can be interpreted as an identifica- 
tion (isometrically isomorphic) of L'(|u|) with {n € M(X): 1 < |p|}. Thus 
f > (fp) is a linear functional on L'(\p|) and |®(fp)| < ||P f|fl dim. 
Hence there is an F(p) in L*(|u|) such that ®( fu) = {fF(p) dp for every f 
in L\(|p|) and ||F(z)||,, < ||O||. (We have been a little nonchalant about 
using p or |p|, but what was said is perfectly correct. Fill in the details.) In 
particular, taking f=1 gives ®(n) = /F(p) dp. It must be shown that 
F € L™(M(X)); it then follows that ® = ®,. and ||®,1| = ||F\i,. 

To show that F © L®(M(X)), let » and v be measures such that pp « »p. 
By the Radon—Nikodym Theorem, there is an f in L'(|u|) such that v = fw. 
Hence if g © L(|r|), then gf © L(\ul) and fgdv = {gfdu. Thus, 
[gF(v) dv = O(gv) = O(gfu) = fafF(u) dp = fgF(u)dv. So F(v) = F(u) 
a.e.[v] and F © L*(M(X)). lis 


EXERCISES 
1. Complete the proof of Proposition 5.4. 


2. Show that #* is a normed space. 
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3. Give an example of a measure space (X, 22,1) that is not o-finite for which the 
conclusion of Theorem 5.6 is false. 


4. Let {%,: i © T} be a collection of normed spaces. If 1 < p < 00, show that the 
dual space of @ 2, . is isometrically isomorphic to @ 2%, where 1/p + 1/qg = 1. 


t 


5. If 2, %5, ... are normed spaces, show that (®,2%,,)* is isometrically isomorphic 
to © 4%. 

6. Let n >1 and let C‘”[0,1] be defined as in Example 1.10. Show that ||f|| = 
vr dif (O)| + sup{ if (x)|: 0 < x < 1} is an equivalent norm on C‘” (0, 1]. 
Show that L € (C‘”[0,1])* if and only if there are scalars ay,a,,...,@,,_, and 
a measure p on [0,1] such that L(f) = Li tja, f% (0) + ff du. Is there a 
formula for ||Z|| in terms of Jao], |a,|,---,]a,_,|, and |||}? 


§6. The Hahn-—Banach Theorem 


The Hahn—Banach Theorem is one of the most important results in 
mathematics. It is used so often it is rightly considered as a cornerstone of 
functional analysis. It is one of those theorems that when it or one of its 
immediate consequences is used, it is used without quotation or reference 
and the reader is assumed to realize that it is being invoked. 


6.1. Definition. If & is a vector space, a sublinear functional 1s a function 
gq: &— R such that 


(a) g(x + y) < q(x) + q(y) for all x, y in Z; 
(b) g(ax) = aq(x) for x in 2 and a > 0. 


Note that every seminorm is a sublinear functional, but not conversely. In 
fact, it should be emphasized that a sublinear functional is allowed to 
assume negative values and that (b) in the definition only holds for a > 0. 


6.2. The Hahn—Banach Theorem. Let 2 be a vector space over R and let q 
be a sublinear functional on 2. If 4 is a linear manifold in & and f: MH — R 
is a linear functional such that f(x) < q(x) for all x in , then there is a 
linear functional F: &— R such that F| 4 = f and F(x) < q(x) for all x in 
g. 


Note that the substance of the theorem is not that the extension exists but 
that an extension can be found that remains dominated by q. Just to find an 
extension, let {e,;} be a Hamel basis for / and let { y,} be vectors in % 
such that {e,} U { y,} is a Hamel basis for 2. Now define F. 2 —> R by 
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F(X ,a,e; + UB) y;) = Lia; f(e;) = f(v,a,e;). This extends f. If {y,} is any 
collection of real numbers, then F(2;a,;e; + LB, y,) = f(U,a,e;) + UB; is 
also an extension of f. Moreover, any extension of f has this form. The 
difficulty is that we must find one of these extensions that is dominated 
by q. 

Before proving the theorem, let’s see some of its immediate corollaries. 
The first 1s an extension of the theorem to complex spaces. For this a lemma 
is needed. Note that if 2 is a vector space over C, it is also a vector space 
over R. Also, if f: 2— C is C-linear, then Ref: #— R is R-linear. The 
following lemma is the converse of this. 


6.3. Lemma. Let % be a vector space over C. 


(a) If f: £>R is an R-linear functional, then f(x) = f(x) — if(ix) is a 
C-linear functional and f = Ref. 

(b) If g: ¥— C is C-linear, f = Reg, and f is defined as in (a), then f = g. 

(c) If p is a seminorm on & and f and f are as in (a), then |f(x)| < p(x) for 
all x if and only if |f(x)| < p(x) for all x. 

(d) If & is a normed space and f and f are as in (a), then |\f\| = |\f\\. 


PROOF. The proofs of (a) and (b) are left as an exercise. To prove (c), 
suppose |f(x)| < p(x). Then f(x) = Re f(x) < |f(x)| < p(x). Also, 
— f(x) = Ref(—x) < |f(—x)| < p(x). Hence |f(x)| < p(x). Now assume 
that |f(x)| < p(x). Choose @ such that f(x) = e'®| f(x)|. Hence If(x)| = 
flex) = Ref(e~*x) = flex) < p(e x) = p(x). 

Part (d) is an easy application of (c). a 


6.4. Corollary. Let 2 be a vector space, let M be a linear manifold in &, 
and let p: & — [0, 00) be a seminorm. If f: M — F is a linear functional such 
that |f(x)| < p(x) for all x in M, then there is a linear functional F: > ¥ 
such that F| M = f and |F(x)| < p(x) for all x in &. 


PROOF. Case 1: F = R. Note that f(x) < |f(x)| < p(x) for x in . By 
(5.2) there is an extension F: 2— R of f such that F(x) < p(x) for all x. 
Hence — F(x) = F(—x) < p(—x) = p(x). Thus |F| < p. 

Case 2: F = C. Let f, = Ref. By (6.3c), |f,| < p. By Case 1, there is an 
R-linear functional F;: 2—R such that F,|W=/f, and |F,| < p. Let 
F(x) = F\(x) — iF,(ix) for all x in 2. By (6.3c), |F| < p. Clearly, F\.4 = f. 

a 


6.5. Corollary. Jf % is a normed space, is a linear manifold in &, and f: 
M — F is a bounded linear functional, then there is an F in &* such that 
F\M = f and ||F\| = |lf\l.- 


PROOF. Use Corollary 6.4 with p(x) = ||Al| ||x]|. | 
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6.6. Corollary. If 2 is a normed space, {X,,X5,...,Xq4} is a linearly 
independent subset of 2, and a,,0,,...,@, are arbitrary scalars, then there 
isan fin &* such that f(x;)= a; forl <j <d. 


Proor. Let .@ = the linear span of x,,...,x, and define g: “4 —F by 
g(u,B.x;) = u,B,a;. So g is linear. Since @ is finite dimensional, g is 
continuous. Let f be a continuous extension of g to &. a] 


6.7. Corollary. Jf & is a normed space and x © 2, then 


lIxl| = sup{If(x)|: fe * and |\fi| < 1}. 
Moreover, this supremum is attained. 


Proor. Let a = sup{|f(x)|: fe 4* and ||f]| < 1}. ff fe %* and |[fl| < 1, 
then |f(x)| < |[All |x|] < [|x|]; hence a < ||x||. Now let Z= { Bx: B © F} 
define g: @->F by g(Bx)= Bilx||. Then g &.@* and ||g|| =1. By 
Corollary 6.5, there is an f in 2* such that ||fl| = 1 and f(x) = g(x) = 
Pd | 


This introduces a certain symmetry in the definitions of the norms in % 
and &* that will be explored later (§11). 


6.8. Corollary. If % is a normed space, H<%, x5 ©€X\M, and d= 
dist(x,), 4), then there is an fin &* such that f(x.) = 1, f(x) = 0 for all x 
in M, and ||f\| = d7'. 


Proor. Let 0: &— Z/M be the natural map. Since ||x, + 4|| = d, by the 
preceding corollary there is a g in (2%/@)* such that g(x, + #) =d and 
\|g|| = 1. Let f = d~'g°Q: #— F. Then f is continuous, f(x) = 0 for x in 
M, and f(x) = 1. Also, |f(x)| = 4~*|g(Q(x))| sd "]O(x)I] s aI); 
hence ||/|| < d~'. On the other hand, ||g|| = 1 so there is a sequence { x, } 
such that |g(x, +-4@)| — 1 and ||x, +-4@|| < 1 for all n. Let y, © @ such 
that ||x, + y,l| <1. Then [f(x, + y,)] = d'Ig(x, + | > d-4, so |Ifll = 
dq’. 


To prove the Hahn—Banach Theorem, we first show that we can extend 
the functional to a space of one dimension more. 


6.9. Lemma. Suppose the hypothesis of (6.2) is satisfied and, in addition, 
dim 2/4 = 1. Then the conclusion of (6.2) is valid. 


PROOF. Fix x) in 2\.4; soF=AMV {xo} ={txyo ty tEeR, ye}. 
For the moment assume that the extension F: 2—R of f exists with 
F < q. Let’s see what F must look like. Put a) = F(x,). If t > 0 and 
y, € 4, then F(tx) + y,) = tay + f(,) S G(tXp + y,). Hence ay < 
tf (Wy) + 0 'g(tXo + V1) = —fO1/1) + G(X + ¥1/t) for every yy in 
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M. Since y,/t © 4%, this gives that 


6.10 a < —f(y,) + (xo +1) 


for all y, in 4. Also note that if a satisfies (6.10), then by reversing the 
inequalities in the preceding argument, it follows that ta, + f(y,) < q(txy 
+ y,) whenever ¢ > 0. 

If t>Oand y, © @ andif Fexists, then F(—tx,) + y,) = —tay + f()>) 
< q(—tx, + y,). As above, this implies that 


6.11 ay = f(y) — a(x + V2) 


for all y, in 4. Moreover, (6.11) is sufficient that —tay) + f()») < q(—txo 
+ y,) for all t > 0 and y, in %Z. 

Combining (6.10) and (6.11) we see that we must show that a, can be 
chosen satisfying (6.10) and (6.11) simultaneously. Thus we must show that 


6.12 f (2) — W(~ x0 + ¥2) S$ —fC1) + (Xo +1) 
for all y,, y> in 4. But this means we want to show that f(y, + y) < q(Xo 
+ 1) + Q(— Xo + yz). But 
f(y. +2) S$ g(1 + 2) = q((1 + x9) +(—X9 + y,)) 

< q(y, + Xo) + a(—Xo + Wn), 
so (6.12) is satisfied. If a) is chosen with sup{ f(y.) — q(—X 9 + yp): 
yy © M} Say < nf(—f(y)) + Q(X + yy): yy EM} and F(txy + y) = 
ta, + f(y,), F satisfies the conclusion of (6.2). | 


PROOF OF THE HAHN—BANACH THEOREM. Let # be the collection of all 
pairs (.4,, f,), where @, is a linear manifold in % such that 4, DW and 
fi: 4, > R is a linear functional with fj|W%=/f and f, <q on -4,. If 
(4,, f,) and (.4,, f,) © F, define (.4,, f,) s (4, f,) to mean that -Z, 
CM, and f,|4,=f,. So (%, <) is a partially ordered set. Suppose 
= {(4,, f,: ie IT} isachainin #. If W= ULW;: ie I}, then the fact 
that @ is a chain implies that / is a linear manifold. Define F: W— R by 
setting F(x) = f,(x) if x © Z,. It is easily checked that F is well defined, 
linear, and satisfies F < g on VY. So(%V,F)€¥F and (%, F) is an upper 
bound for @. By Zorn’s Lemma, £ has a maximal element (¥Y, F). But the 
preceding lemma implies that “= 2%. Hence F is the desired extension. 

z 


This section concludes with one important consequence of the Hahn- 
Banach Theorem. It will be generalized later (IV.3.11), but it is used so often 
it is worth singling out for consideration. 


6.13. Theorem. Jf 2 is a normed space and 4 is a linear manifold in &, 
then 


cl W =[(\{kerf: fe X* and MC kerf }. 
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ProoF. Let W= (kerf: fe %* and WC kerf}. lf feR* and We 
ker f, then the continuity of f implies that cl.@C kerf. Hence cl ZC YN. 
If x, € cl. 4, then d = dist(x9,.#) > 0. By Corollary 6.8 there is an f in 
#* such that f(x.) = 1 and f(x) = 0 for every x in 4. Hence x) EV. 
Thus “Cc cl.@ and the proof is complete. a 


6.14. Corollary. If % is a normed space and is a linear manifold in 2, 
then M is dense in & if and only if the only bounded linear functional on & 
that annihilates is the zero functional. 


EXERCISES 
1. Complete the proof of Lemma 6.3. 
2. Give the details of the proof of Corollary 6.5. 


3. Show that c* is isometrically isomorphic to /'. Are c and cy isometrically 
isomorphic? 


4. If » is a measure on [0,1] and {x" du(x) = 0 for all n > 0, show that p = 0. 


5. If n > 1, show that there is a measure p on [0,1] such that for every polynomial 
p of degree at most n, 


[rdw =X P(k/n). 

k=1 

6. If n > 1, does there exist a measure p on [0,1] such that p’(0) = {pdp for every 
polynomial of degree at most n? 


7. Does there exist a measure p on [0,1] such that {pdp = p’(0) for every 
polynomial p? 


8. Let K be a compact subset of C and define A(K) to be {fe C(K): f is 
analytic on int K }. (Functions here are complex valued.) Show that if a € K, 
then there is a probability measure » supported on 0K such that f(a) = /fdp 
for every f in A(K). (A probability measure is a nonnegative measure p such 
that [ull = 1.) 


9. If K=clD (D = {|z| < 1}) and ae K, find the measure » whose existence 
was proved in Exercise 8. 


10. Let P = { p|OD: p = an analytic polynomial} and consider P as a manifold in 
C( dD). Show that if u is a real-valued measure on 0D such that {pd = 0 for 
every p in P, then p = 0. Give an example of a complex-valued measure p such 
that » + 0 but {pdp = 0 for every p in P. 


§7*. An Application: Banach Limits 
If x = {x(n)} € c, define L(x) = lim x(n). Then L is a linear functional, 


\|Z|| = 1, and, if for x in c, x’ is defined by x’ = (x(2), x(3), ...), then 
L(x) = L(x’). Also, if x > 0 [that is, x(n) => 0 for all nj, then L(x) > 0. 
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In this section it will be shown that these properties of the limit functional 
can be extended to /~. The proof uses the Hahn-—Banach Theorem. 


7.1. Theorem. There is a linear functional L: 1° — F such that 


(a) ||L|| = 1. 

(b) [fx Ec, L(x) = limx(n). 

(c) Ifx = 1° and x(n) = 0 for all n, then L(x) = 0. 

(d) Ifx €1™ and x’ = (x(2), x(3), ...), then L(x) = L(x’). 


PROOF. First assume F = R; that is, 1° = /p. If x € /™, let x’ denote the 
element of /° defined in part (d) above. Put 4 = {x — x’: x © 1°}. Note 
that (x + ay)’ = x’+ ay’ for any x, y in /* and a in R; hence 4 isa 
linear manifold in /*. Let 1 denote the sequence (1,1,1, ...) in /®. 


7.2. Claim. dist(1,.#) = 1. 


Since 0 € 4, dist(1,.H) < 1. Let x € 1”; if (x — x’\(n) < 0 for any n, 
then |/1 — (x — x’)||,, = [1 -— (x() — x(n))| = 1. Suppose 0 < (x - 
x’\(n) = x(n) — x(n) = x(n) — x(n + 1) for all n. Thus x(n + 1) < x(n) 
for all n. Since x € /”, a = lim x(n) exists. Thus lim(x — x’)(n) = 0 and 
|1 — (x — x’)||,, = 1. This proves the claim. 

By Corollary 6.8 there is a linear functional L: /* — R such that 
||Z||} = 1, LG) = 1, and L(.@) = 0. So this functional satisfies (a) and (d) 
of the theorem. To prove (b), we establish the following. 


7.3. Claim. cy € ker L. 


If x Eco, let x =x’ and let x7) =(x™)’ for n> 1. Note that 


xP) — y= [xD — yM] 4 --- +x’ — x] © WM. Hence L(x) = 
L(x) for all n> 1. If ¢ > 0, then let n be such that |x(m)| <e for 
m>n. Hence |L(x)| = |L(x™)| < |[x?||,, = sup{|x(m)|: m>n}<e. 


Thus x € ker L. Condition (b) is now clear. 

To show (c), suppose there is an x in /® such that x(n) > 0 for all n and 
L(x) < 0. If x is replaced by x/||x||,, it remains true that L(x) < 0 and it 
is also true that 1 > x(n)>0 for all n. But then |j/1 — x||,, <1 and 
LO — x)=1-— L(x) > 1, contradicting (a). Thus (c) holds. 

Now assume that F = C. Let L, be the functional obtained on /p. If 
xeE/le, then x =x, + ix, when x,,x, €&/R. Define L(x) = L,(x,) + 
iL ,(x,). It is left as an exercise to show that L is C-linear. It’s clear that (b), 
(c), and (d) hold. It remains to show that ||Z]|| = 1. 

Let E,,..., £,, be pairwise disjoint subsets of N and let a,,...,a,,€ C 
with ja,| <1 for all k. Put x = Lf_ja,xz 5 sox Ee? and ||x]|,, < 1. 
Then L(x) = Lya,L(xe,) = LyexLi(xe,). But Li(xe,) = 0 and 
Ly Li(Xe,) = Li(Xe), where E =U,E,. Hence X,L,(xz,) < 1. Because 
|a,| < 1 for all k, |L(x)| < 1. (This is a small convexity argument.) If x is 
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an arbitrary element of /°, ||x||,, < 1, then there is a sequence {x,,} if 
elements of /~ such that ||x, — x||,, > 0, ||x,||,, < 1, and each x, is the 
type of element of /* just discussed that takes on only a finite number of 
values (Exercise 3). Clearly, ||Z|| < 2, so L(x,,) > L(x). Since |L(x,,)| < 1 
for all n, |L(x)| < 1. Hence |jL]| < 1. Since LQ) = 1, |/L]} = 1. a 


A linear functional of the type described in Theorem 7.1 1s called a 
Banach limit. They are useful for a variety of things, among which is the 
construction of representations of the algebra of bounded operators on a 
Hilbert space. 


EXERCISES 


1. If L is a Banach limit, show that there are x and y in /© such that L(xy) # 
L(x)L(y). 


2. Let X be a set and Q a o-algebra of subsets of X. Suppose p is a complex-valued 
countably additive measure defined on (2 such that |||| = u(X) < oo. Show that 
u(A) = 0 for every A in 22. (Though it is difficult to see at this moment, this fact 
is related to the proof of (c) in Theorem 7.1 for the complex case.) 


3. Show that if x € /™, ||x||,, <1, then there is a sequence {x,,}, x, in /° such 
that ||x,||,, <1, ||x, — xl] ~ 0, and each x, takes on only a finite number of 
values. 


§8*. An Application: Runge’s Theorem 


8.1. Runge’s Theorem. Let K be a compact subset of C and let E be a subset 
of C,,\ K that meets each component of C,,\K. If f is analytic in a 
neighborhood of K, then there are rational functions f,, whose only poles lie in 
E such that f,, ~ f uniformly on K. 


The main tool in proving Runge’s Theorem is Theorem 6.13. (A proof 
that does not use functional analysis can be found on p. 198 of Conway 
[1978].) To do this, let R(K, £) be the closure in the space C(K ) of the 
rational functions with poles in E. By (6.13) and the Riesz Representation 
Theorem, it suffices to show that if » © M(K) and /gdu = 0 for each g in 
R(K, E), then ffdp = 0. 

Let R > O and let A be area measure. Pick p > 0 such that B(O; R) Cc 
B(z; ep) for every z in K. Then for z in K, 


i] lz — w|dd(w) < | iz —w] 'ddA(w) 
B(O; R) B(z;p) 


= [Cf aa = 270. 
0 “0 
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If uw © M(K), define fi: C — [0, co] by 


ii(w) _ d\u|(z) 


Iz — w 
when the integral is finite, and fi(w) = oo otherwise. The inequalities above 
imply (2) 
- | 
fo wlw)drw) = fof TERS an(w) 
B(0; R) B(O; R)YK |Z 
ae 
= d 
ia Ry I2 — w{ ml ) 


< 27||hI. 
Thus fi(w) < 00 ae. [A]. 


8.2. Lemma. Jf p © M(K), then 


ji(w) = j= 


w 
is in L'(B(0O; R), A) for any R > 0, fi is analytic on C ,, \ K, and ji(oo) = 0. 


PROOF. The first statement follows from what came before the statement of 
this lemma. To show that f is analytic on C,, \ K, let w,w) © C \ K and 


note that 
iw) — Blow) dz) 


WoW Ie (z—w)(z—m%) 


AS w > Wp, [(z — w)\(z — w)]7 | > (z — Ww)? uniformly for z in K, so 
that ~ has a derivative at w, and 


Se (ovo) = f (2 = 0)? du (2). 


So fi is analytic on C \ K. To show that it is analytic at infinity, note that 
f(z) > 0 as z > oo, So infinity is a removable singularity. z 


It is not difficult to see that for w) nC \ K, 
d t —n-1 
ean = yl eae 
8.3 a (wo) a { (2 wo) du(z) 
Also, we can easily find the power series expansion of fi at infinity. Indeed, 
: 1 1 2 
ji(w) Sif du(z) = — f(t = =| du(z). 


Choose w near enough to infinity that |z/w| < 1 for all z in K. Then 


8. iw) = = ¥ f(Z) ante) 


where a, = {z"du(z). 
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Now assume p © M(K) and {gdp = 0 for every rational function g with 
poles in E. Let U be a component of C, \ K, and let we EN U. If 
Wy # 00, then the hypothesis and (8.3) implies each derivative of f at we 
vanishes. Hence fi = 0 on U. If w. = oo, then (8.4) implies f = 0 on U. Thus 
p=OonC. \K. 

If f is analytic on an open set G containing K, let y,,...,y, be 
straight-line segments in G\ K oe that 


f(2)= ¥ ial oa 


for all z in K. (See p. 195 of ca [1978].) Thus 
1 f(w) 
J f(2) aul) a oe Bee dw du(z) 
n 1 
— L 55 f fl)a(») aw 
k=1 Yk 


by Fubini’s Theorem. But f(w)=0 on y, (C C\K), so ffdp = 0. By 
(6.13), f © R(K, E). This proves Runge’s Theorem. a 


| ai 


8.5. Corollary. Jf K is compact and C \ K is connected and if f is analytic in 
a neighborhood of K, then there is a sequence of polynomials that converges to 
f uniformly on K. 


EXERCISES 


1. Let » be a compactly supported measure on C that is absolutely continuous with 
respect to area measure. Show that fi is continuous on C, . 


2. Let m = Lebesgue measure on [0,1]. Show that m is not continuous at any point 
of [0,1]. 


§9*. An Application: Ordered Vector Spaces 


In this section only vector spaces over R are considered. 

There are numerous spaces in which there is a notion of < in addition to 
the vector space structure. The L? spaces and C( X) are some that spring to 
mind. The concept of an ordered vector space is an attempt to study such 
spaces in an abstract setting. The first step is to abstract the notion of the 
positive elements. 


9.1. Definition. An ordered vector space is a pair (2, <) where 4 is a 
vector space over R and < isa relation on & satisfying 


(a) x < x for all x; 
(b) if x < y and y <z, then x <z; 
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(c) ifx <yandze@,thenx+z<y+t+z; 
(d) if x < y and a € [0, 00), then ax < ay. 


Note that it is not assumed that < 1s antisymmetric. That is, it is not 
assumed that if x < y and y < x, then x = y. 


9.2. Definition. If Z is a real vector space, a wedge is a nonempty subset P 
of & such that 


(a) ifx,y & P, then x +yeEP; 
(b) if x € P and a € (0, 00), then ax € P. 


9.3. Proposition. (a) If (2, <) is an ordered vector space and P = {x © &: 
x > 0}, then P is a wedge. (b) If P is a wedge in the real vector space ® and 
< is defined on & by declaring x < y if and only if y — x € P, then (2%, <) 
is an ordered vector space. 


PROOF. Exercise. 


If (2, <) is an ordered vector space, P = {x € : x > 0} 1s called the 
wedge of positive elements. The next result is also left as an exercise. 


9.4. Proposition. /f (2%, <) is an ordered vector space and P is the wedge of 
positive elements, < is antisymmetric if and only if P 0 (—P) = (0). 


9.5. Definition. A cone in % is a wedge P such that P M (—P) = (0). 


9.6. Definition. If (2%, <) is an ordered vector space, a subset A of % is 
cofinal if for every x > 0 in & there is an a in A such that a> x. An 
element e of Z is an order unit if for every x in 2 there is a positive integer 
n such that —ne < x < ne. 


If X is a compact space and #= C(X), then any constant function is an 
order unit. (f < g if and only if f(x) < g(x) for all x). If #= C(R), all 
real-valued continuous functions on R, then 2 has no order unit (Exercise 
4). If e is an order unit, then {ne: n > 1} is cofinal. 


9.7. Definition. If (2, <) and (Y, <) are ordered vector spaces and T: 
4 — Y is a linear map, then T is positive (in symbols T > 0) if Tx > 0 
whenever x > 0. 


The principal result of this section is the following. 


9.8. Theorem. Let (2%, <) be an ordered vector space and let Y be a linear 
manifold in & that is cofinal. If f: Y— R is a positive linear functional, then 
there is a positive linear functional f: & — R such that f|Y= f. 


PRooF. Let P= {x © #: x > 0} and put 2, = Y+ P — P. It is easy to 
see that 2, isa linear manifold in 2%. If there is a positive linear functional 
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g: £, > R that extends f, let f be any linear functional on % that extends 
g (use a Hamel basis). If x > 0, then x € PC 2, so that f(x) = g(x) = 0. 
Hence f is positive. Thus, we may assume that = Y+ P — P. 


9.9. Claim. 7= Y¥+ P=Y-P. 


LetxE€2%;sox=ytp,—p., ymY, py, p, in P. Since ¥ is cofinal 
there is a y, in Y such that y, > p,. Hence p, = y, —(y, —-p,) © Y— P. 
Thus. = y-= py + pp, (F=f) + (4 =2) CY =f. S80 2 = Ve Pf. 
Also, = -—-#= -Y+P=Y+P. 


9.10. Claim. If x © 2, there are y,, y, in Y such that y, < x < yy. 


In fact, Claim 9.9 states that we can write x = y, — Py =}. + Po, Pi Po 
& P and )y,, y, © Y. Thus y» < x < yy. 
By Claim 9.10, it is possible to define for each x in 2, 


q(x) = inf{ f(y): y © Yand y = x}. 
9.11. Claim. The function g is a sublinear functional on 2. 
The proof of (9.11) is left as an exercise. 


For y in &, let y, € Y such that y, = y. Because f is positive, f(y) < 
f(y). Hence f(y) < q(y) for all y in Y. The Hahn—Banach Theorem 
implies that there is a linear functional f: #— R such that f/¥=f and 
f<qon &. If x © P, then —x < 0 (and 0 € Y). Hence g(-x) < f(0). 
Thus — f(x) = f(—x) < q(—-x) < 0, or f(x) > Q. Therefore if IS positive. 

al 


9.12. Corollary. Let (%, <) be an ordered vector space with an order unit e. 
If Y is a linear manifold in and e € Y, then any positive linear functional 
defined on Y has an extension to a positive linear functional defined on &. 


EXERCISES 
1. Prove Proposition 9.3. 
2. Prove Proposition 9.4. 


3. Show that e is an order unit for (2, <) if and only if for every x in 2 there isa 
6 > 0Osuch thate+x>0for0<1r<6. 


4. Show that C(IR), the space of all continuous real-valued functions on R, has no 
order unit. 


5. Prove (9.11). 


6. Characterize the order units of C,(X). Does C,(X) always have an order unit? 
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7. Characterize the order units of C,(X) if X is locally compact. Does C)(X) 
always have an order unit? 


8. Let = M,(IR), the 2 X 2 matrices over R. Define A in M,(R) to be positive if 
A = A* and (Ax, x) > 0 for all x in R*. Characterize the order units of M,(R). 


9. If 1<p< oo and = L?(0,1), define f < g to mean that f(x) < g(x) ae. 
Show that & is an ordered vector space that has no order unit. 


§10. The Dual of a Quotient Space and a Subspace 


Let & be a normed space and 4 < 2%. lf fe %*, then f|.4, the restriction 
of f to , belongs to #* and ||f|.4| < |lf\l. According to the 
Hahn-Banach Theorem, every bounded linear functional on -4 1s obtain- 
able as the restriction of a functional from % *. In fact, more can be said. 

Note that if Z* = {g © #*: g() = 0} (note the analogy with Hilbert 
space notation); then .#+ is a closed subspace of the Banach space #*. 
Hence &*/@~* is a Banach space. Moreover, if f+ #*< #*/M~, then 
f+.@~ induces a linear functional on .4, namely f|.Z. 


10.1. Theorem. Jf 4< 2% and M~={gE#*: g(M)=0}, then the 
map p: &*/M~* —> M* defined by 


p(ft+a*)=f\|a 


is an isometric isomorphism. 


PROOF. It is easy to see that p is linear and injective. If fe %* and 
ge M~-, then |{f|Z|| = |l(f+ g)|Z4|I| < |lf + g||. Taking the infimum over 
all g we get that ||f|.Z|| < ||f +4 ~||. Suppose ¢ € .@*. The Hahn—Banach 
Theorem implies that there is an f in 2%* such that f|4@=o9 and 
Ill = ||$|]. Hence ¢ = p(f +. @~) and |[¢|| = If] = Ilf + 4. 


Now consider 2/4; what is (2/7)*? Let O: ¥— 2/M be the natural 
map. If fe (%/@)*, then foQO EB* and |i/f°Q|| < ||f\|. (Why?) This 
gives a way of mapping (2/7)* — 2@*. What is its image? Is it an 
isometry? 


10.2. Theorem. Jf 4 < % and Q: £->2/M is the natural map, then 
e(f) =f °Q defines an isometric isomorphism of (&/M)* onto M~. 


Proor. If f€ (%/@)* and y €.4, then foQ(y)=0, so fe-QOe 4". 
Again, it is easy to see that p: (2/W)* —.@~ is linear and, as was seen 
earlier, ||o(/)|| < |[f||. Let {x, +@} be a sequence in 2/Z@ such that 
|x, + 4] < 1 and |f(x, +-Z)| — |||. For each » there isa y, in @ such 


that ||x, + y,|| < 1. Thus ||o(f)|| = lo( f(x, + 1 = Px, +) > IVA, 
SO p iS an isometry. 
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To see that p is surjective, let g€.W%~; then g © 2%* and g(.Z) = 0. 
Define f: 4/4 — F by f(x + ZW) = g(x). Because g(.W) = 0, f is well 
defined. Also, if x € @ and y € 4%, |f(x + @)| = |g(x)| = |g(x + y)| < 
lg|| || + y||. Taking the infimum over all y gives |f(x +.Z)| < ||g|| ||x + 
M\|. Hence f © (2/M)*, p(f) = g, and |If| < llo(/)|l. 


§11. Reflexive Spaces 


If Z is a normed space, then we have seen that 2 * is a Banach space (5.4). 
Because #* is a Banach space, it too has a dual space (2 *)* = 2** and 
4 ** is a Banach space. Hence 2** has a dual. Can this be kept up? 
Before answering this question, let’s examine a curious phenomenon. If 
x © #, then x defines an element x of 2 **; namely, define *: 2 * — F by 


11.1 R(x*) = x*(x) 


for every x* in &*. Note that Corollary 6.7 implies that ||xX|| = ||x|| for all 
x in 2. The map x > x of > #** is called the natural map of % into 
its second dual. 


11.2. Definition. A normed space 2% is reflexive if @** = {X: x EX}, 
where xX is defined in (11.1). 

First note that a reflexive space 2 is isometrically isomorphic to 2 **, 
and hence must be a Banach space. It is not true, however, that a Banach 
space & that is isometric to %** is reflexive. The definition of reflexivity 
stipulates that the isometry be the natural embedding of 2% into 2**. In 
fact, James [1951] gives an example of a nonreflexive space 2 that is 
isometric to 2 **. 


11.3. Example. If 1 < p < o, L?(X,Q, 1) is reflexive. 


11.4. Example. c, is not reflexive. We know that cf = /’, so c&* = (1')* 
= /~. With these identifications, the natural map c, — c** is precisely the 
inclusion map Cy > /*. 

A discussion of reflexivity is best pursued after the weak topology is 
understood (Chapter V). Until that time, we will say adieu to reflexivity. 


EXERCISES 
1. Show that (%*)** and (2 **)* are isometrically isomorphic. 
2. Show that C,(X) is reflexive if and only if X is finite. 


3. Let “4 < & and let pz: ¥> X** and py: W—> M** be the natural maps. If 
i: M — & is the inclusion map, show that there is an isometry @: “@** > #** 
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such that the diagram 


gy Par Ox 


if le 
M ———> M** 
Pa 


commutes. Prove that ¢(4**) = (@~)* = {x** © B**: x**(M~) = 0}. 


4. Use Exercise 3 to show that if 2 is reflexive, then any closed subspace of 2 is 
also reflexive. 


§12. The Open Mapping and Closed Graph Theorems 


12.1. The Open Mapping Theorem. If 2,% are Banach spaces and A: 
X — Y is a continuous linear surjection, then A(G) is open in Y whenever G 
is open in &. 


PROOF. For r > 0, let B(r) = {x € &: ||x|| <r}. 
12.2. Claim. 0 € intcl A( B(r)). 


Note that because A is surjective, Y = U%_,cl[A(B(kr/2))] = 
US_,kcl{ A( B(r/2))]. By the Baire Category Theorem, there is a k > 1 such 
that kcl[A(B(r/2))] has nonempty interior. Thus V = int{cl[ A( B(r/2))]} 
#0. If ye V, let s>O0 such that {yeEe®@: |ly-yl]l<s} CVC 
cl A(B(r/2)). Let yEY, |ly|| < s. Since y, € cl A(B(r/2)), there is a 
sequence {x,} in B(r/2) such that A(x,,) > yo. There is also a sequence 
{z,} in B(r/2) such that A(z,) > y+ y. Thus A(z, —x,)— y and 
{Z, —x,}C B(r); that is, {y © &: |ly|| <s} Cc cl A(B(r)). This estab- 
lishes Claim 12.2. 

It will now be shown that 


12.3 cl A( B(r/2)) C A(B(r)). 


Note that if (12.3) is proved, then Claim 12.2 implies that 0 € int A(B(r)) 
for any r > 0. From here the theorem is easily proved. Indeed, if G is an 
open subset of %, then for every x in G let r, > 0 such that B(x; r,) C G. 
But O € int A(B(r,)) and so A(x) € int A( B(x; 7r,)). Thus there is an 
s, > 0 such that U.={ye¥Y%: |ly — A(x)|| < s,} C AC B(x; r,)). There- 
fore A(G) > U{U,: x © G}. But A(x) € U,, so A(G) = U{U,: x © G} and 
hence A(G) is open. 

To prove (12.3), fix y, in cl A( B(r/2)). By (12.2), 0 € int[cl A( B(2~r))]. 
Hence [y, — cl A(B(2>?r))] N A(B(r/2)) # O. Let x, © B(r/2) such that 
A(x,) © [y, — el A(B(2~*r))]; now A(x,) = y, — y2, where y, € 
cl A( B(2~’r)). Using induction, we obtain a sequence {x,} in @ and a 
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sequence { y, } in Y such that 


(i) x, € B(2~"r), 
12.4 § (ii) y, € cl A(B(2~"r)), 
(iii) Yn+1 ~ Yn AL.) 


But ||x,|| < 27"r, so LP||x,|| < oo; hence x = Lx, exists in & and 
|x|] <r. Also, 
e A(x,) = > (Ye — Vari) = V1 — nat 
k=1 k=1 
But (12.411) implies |ly,|| < ||A||2~’7; hence y, > 0. Therefore y, = 
Le_, A(x,) = A(x) € A(B(r)), proving (12.3) and completing the proof of 
the theorem. = 


The Open Mapping Theorem has several applications. Here are two 
important ones. 


12.5. The Inverse Mapping Theorem. /f 2 and Y are Banach spaces and A: 
%-— Y is a bounded linear transformation that is bijective, then A~' is 
bounded. 


Proor. Because A is continuous, bijective, and open by Theorem 12.1, A is 
a homeomorphism. ea 


12.6. The Closed Graph Theorem. Jf 2 and Y are Banach spaces and A: 
& — Y is a linear transformation such that the graph of A, 


grad={x@®@AxeE®OY:xEX} 
is closed, then A is continuous. 


Proor. Let Y= gra A. Since %6,¥Y is a Banach space and § is closed, 
is a Banach space. Define P: ¥ > & by P(x ® Ax) = x. It is easy to check 
that P is bounded and bijective. (Do it). By the Inverse Mapping Theorem, 
P~!: €- @ is continuous. Thus A: 2—> Y is the composition of the 
continuous map P~': > G and the continuous map of 9 > ¥ defined by 
x ® Ax ~ Ax; A 1s therefore continuous. ra 


Let 2= all functions f: [0,1] — F such that the derivative f’ exists and 
is continuous on [0,1]. Let Y= C[0,1] and give both 2 and ¥Y the 
supremum norm: ||f]| = sup{|f(t)|: ¢ © [0,1]}. So % is not a Banach space, 
though ¥Y is. Define A: 2— Y by Af = f’. Clearly, A is linear. If {f,} Cc & 
and (f,, f,) > (/, g) in 2X Y, then f,’ > g uniformly on [0, 1]. Hence 


f,(t) — f,(0) = ffi(s) ds > f's(s) ds. 
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But /,(¢) — f,(0) > f@) — £O), so 
f(t) = £(0) + f(s) ds. 
0 


Thus f’ = g and gra A is closed. However, A is not bounded. (Why?) 

The preceding example shows that the domain of the operator in the 
Closed Graph Theorem must be assumed to be complete. The next example 
(due to Alp Eden) shows that the range must also be assumed to be 
complete. 

Let & be a separable infinite-dimensional Banach space and let {e;: 
i = I} be a Hamel basis for @ with |le,|| = 1 for all i. Note that a Baire 
Category argument shows that J is uncountable. If x € 2, then x = La, e,, 
a, € F, and a, = 0 for all but a finite number of i in J. Define ||x||, = &,\a,]. 
It is left as an exercise for the reader to show that || - ||, 1s a norm on 2. 
Since |le,|| = 1 for all i, ||x|| < X,Ja,| = ||x||,. Let Y= 2% with the norm 
| - ||, and let T: Y> Z be defined by 7(x) = x. Note that it was just 
shown that T~': #— Y is a contraction. Therefore gra T~' is closed and 
hence so is graT. But T is not continuous because if it were, then JT would 
be a homeomorphism. Since 2% is separable, it would follow that Y is 
separable. But ¥ is not separable. To see this, note that |e; — e,||, = 2 for 
i # j and since J is uncountable, Y cannot be separable. 

When applying the Closed Graph Theorem, the following result is useful. 


12.7. Proposition. Jf 2 and Y are normed spaces and A: 4— ¥ is a linear 
transformation, then gra A is closed if and only if whenever x, 0 and 
Ax, — y, it must be that y = 0. 


PROOF. Exercise 3. 


Note that (12.7) underlines the advantage of the Closed Graph Theorem. 
To show that A is continuous, it suffices to show that if x, — 0, then 
Ax, > 0. By (12.7) this is eased by allowing us to assume that { Ax, } is 
convergent. 

It is possible to give a measure-theoretic solution to Exercise 2.3, but here 
is one using the Closed Graph Theorem. Let (X, 2, w) be a o-finite measure 
space, 1 < p < oo, and ¢: X —F an {2-measurable function such that 
of <= L?(w) whenever f & L?(p). Define A: L?(n) > L?(p) by Af = Of. 
Thus A is linear and well defined. Suppose f, - 0 and $f, > g in L?(p). 
If 1 < p < o, then f, ~ 0 in measure. By a theorem of Riesz, there is a 
subsequence { f,, } such that f, (x) > 0 ae. [u]. Hence $(x)f, (x) > 0 ae. 
[uw]. This implies g = 0 and so gra A is closed. If p = o0, then f(x) > 0 
a.e. [u] and the same argument implies gra A is closed. By the Closed Graph 
Theorem, A is bounded. Clearly, it may be assumed that ||A|| = 1. If 6 > 0, 
let E = (x: |(x)| = 1+ 6}. Now ||A"|| < 1, so |/$"fll, < IIAl, for all 
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n> 1. Thus 


UA, = florrrn’ du = (1+ 8)" f IPP de. 


But (1 + 6)”? > oo as n > oo. Hence f,|f|? du = 0 for each f in L?(p), 
and p(£) = 0. It follows that ¢ € L®(p) and |¢| < 1 ae. [p]. 


12.8. Definition. If %,% are Banach spaces, an isomorphism of & and Y 
is a linear bijection T: 2 — Y that is a homeomorphism. Say that & and Y 
are isomorphic if there is an isomorphism of 2% onto &. 


Note that the Inverse Mapping Theorem says that an isomorphism is a 
continuous bijection. 

The use of the word “isomorphism” is counter to the spirit of category 
theory, but it is traditional in Banach space theory. 


EXERCISES 


1. Suppose % and ¥& are Banach spaces. If A © @(#,¥Y) and ran A is a second 
category space, show that ran A 1s closed. 


2. Give both C [0,1] and C[0,1] the supremum norm. If A: C"[0,1] > C[0,1] is 
defined by Af = f’, show that A is not bounded. 


3. Prove Proposition 12.7. 


4. Let & be a vector space and suppose || - ||, and || - ||, are two norms on 2 and 
that 4, and ¥%, are the corresponding topologies. Show that if 2 is complete in 
both norms and ¥, D2 4%, then 4, = 4. 


5. Let # and Y be Banach spaces and let A € @(%,¥Y). Show that there is a 
constant c > 0 such that ||Ax|| => c||x|| for all x in & if and only if kerA = (0) 
and ran A is closed. 


6. Let X be compact and suppose that % is a Banach subspace of C(X). If E isa 
closed subset of X such that for every g in C(£) there is an f in 2 with 
f|E = g, show that there is a constant c > 0 such that for each g in C( F£) there is 
an f in & with f|/E = g and max{|f(x)|: x © X} < cmax{|g(x)|: x © FE}. 


7. Let 1 < p < o and suppose (a;,;) is a matrix such that (Af)(i) = 27 \4;,(/) 
defines an element Af of /? for every f in /?. Show that A € &(/”). 


8. Let (X,2,p) be a o-finite measure space, 1 < p < oo, and suppose that k: 
X X X > F is an Q X Q measurable function such that for f in L’(p) and a.e. 
x, k(x, -)f(-) € L'(p) and (Kf)(x) = [k(x, y) f(y) du(y) defines an element 
Kf of L?(). Show that K: L?(p) > L?(p) is a bounded operator. 
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§13. Complemented Subspaces of a Banach Space 


If 2 is a Banach space and 4 < &, say that -@ is algebraically comple- 
mented in & if thereisan W< 2% suchthat 470 W= (O)and 4+ WV= Z. 
Of course, the definition makes sense in a purely algebraic setting, so the 
requirement that 4 and W be closed seems fatuous. Why is it made? 

If @ is a linear manifold in a vector space 2 (a Banach space or not), 
then a Hamel-basis argument can be fashioned to produce a linear manifold 
MN such that 4AW= (0) and 4+ W=2Z. So the requirement in the 
definition that .@ and W be closed subspaces of the Banach space % 
makes the existence problem more interesting. Also, since we are dealing 
with the category of Banach spaces, all definitions should involve only 
objects in that category. 

If @ and W are algebraically complemented closed subspaces of a 
normed space 2%, then A: 40,W-— & defined by A(m © n)=m+nisa 
linear bijection. Also, ||A(m ® n)|| = ||m + nl < ||m|| + |ln|| = ||m @ nll. 
Hence A is bounded. Say that 4 and Y are topologically complemented if 
A is a homeomorphism; equivalently, if ||| + n||| = |\m|| + ||n|| is an 
equivalent norm. If 2 is a Banach space, then the Inverse Mapping 
Theorem implies A is a homeomorphism. This proves the following. 


13.1. Theorem. If a pair of subspaces of a Banach space are algebraically 
complementary, then they are topologically complementary. 


This permits us to speak of complementary subspaces of a Banach space 
without modifying the term. The proof of the next result is left to the reader. 


13.2. Theorem. (a) Jf 4 and NW are complementary subspaces of a Banach 
space & and E: 4— is defined by E(m +n) =m for min M and n in 
MN, then E is a continuous linear operator such that E* = E, ran E = ™, and 
kerE = W.(b) IfE © &(&) and E? = E, then M = ran Eand NW = kerE 
are complemented subspaces of &. 


If @<2 and @ is complemented in %, its complementary subspace 
may not be unique. Indeed, finite-dimensional spaces furnish the necessary 
examples. 

A result due to R. S. Phillips [1940] is that cy is not complemented in /%. 
A straightforward proof of this can be found in Whitely [1966]. Murray 
[1937] showed that /?, p # 2, p> 1 has uncomplemented subspaces. This 
seems to be the first paper to exhibit uncomplemented subspaces of a 
Banach space. 

Lindenstrauss [1967] showed that if .@ is an infinite-dimensional sub- 
space of /© that is complemented in /”, then -@ is isomorphic to /”. This 
same result holds if /© is replaced by /?, 1 < p < 00, ¢, or Cp. 
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Does there exist a Banach space 2% such that every closed subspace of % 
is complemented? Of course, if 2 is a Hilbert space, then this is true. But 
are there any Banach spaces that have this property and are not Hilbert 
spaces? Lindenstrauss [1971] proved that if 2 is a Banach space and every 
subspace of % is complemented, then 2% is isomorphic to a Hilbert space. 


EXERCISES 


1. If & is a vector space and -@ is a linear manifold in 2, show that there is a 
linear manifold / in & such that 4 W= (0) and 4+ WV=2. 


2. Let & be a Banach space and let E: > ¥ be a linear map such that E* = E 
and both ran E and ker EF are closed. Show that EF is continuous. 


3. Prove Theorem 13.2. 


4. Let & be a Banach space and show that if / is a complemented subspace of 2, 
then every complementary subspace is isomorphic to 2/4. 


5. Let X be a compact set and let Y be a closed subset of X. A simultaneous 
extension for Y is a bounded linear map T: C(Y) — C(X) such that for each g 
in C(Y), T(g)|Y = g. Let GQ(X\ Y) = {f © CCX): f(y) = 0 for all y in Y}. 
Show that if there is a simultaneous extension for Y, then C,( X \ Y) is comple- 
mented in C(X). 


6. Show that if Y is a closed subset of [0,1], then there is a simultaneous extension 
for Y (see Exercise 5). (Hint: Write [0,1]\\ Y as the union of disjoint intervals.) 


7. Using the notation of Exercise 5, show that if Y is a retract of X, then 
Cy(X \ Y) is complemented in C(X). 


§14. The Principle of Uniform Boundedness 


There are several results that may be called the Principle of Uniform 
Boundedness (PUB) and all of these are called the PUB by various 
mathematicians. In this book the PUB will refer to any of the results of this 
section, though in a formal way the next result plays the role of the founder 
of the family. 


14.1. Principle of Uniform Boundedness (PUB). Let 2% be a Banach space 
and Y a normed space. If ZC B(L,Y) such that for each x in 2, 
sup{||Ax||: 4 € 2} < 0, then sup{||A||: 4 € 7} < oo. 


PROOF. (Due to William R. Zame) For each x in 2 let M(x) = sup{||Ax||: 
A € ¥\, 80 ||Ax|| < M(x) for all x in &. Suppose sup{||A||: 4 € a} = oo. 
Then there is a sequence { A, } C Ww and a sequence {x,,} of vectors in 
such that ||x,|| = 1 and ||A,x,|| > 4”. Let y, = 2° "x,,; thus ||y,|| = 27” and 
NA,,Ynll > 2”. 


n? 
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14.2. Claim. There is a subsequence { y, } such that for A > 1: 


(a) l14q,. Yn, qll > 1+ k + DL M(y, )s 
(b) [lyq,,,l1 < 2-*Msup{ll4, I 1 <7 < ky. 


The proof of (14.2) is by induction. Let n, = 1. The induction step is 
valid since ||y,|| — 0 and ||A,,y,|| ~ oo. The details are left to the reader. 

Since L,||y,,,|| < 0, 2, %n, = y in ¥ (here is where the completeness of % 
is used). Now for any k > 1, 


k fore) 


Ang In, a Ang Inga, a pe Anes, 
1 J=uk+2 


Aye VN = 


= 


k 00 
An Sree a lie ze Aang Dn, i 2 Any In, 
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J 
k ore) 

os A vioics Voi ae » Ann, 
y= jere2 


k k 00 
>l+k+ M34) =| B Ms,) 4 Dee ra 
j=l jak+2 


> || A 


nese 


>k. 
That is, M(y) = k for all k, a contradiction. es 


14.3. Corollary. If Z is a normed space and A © &, then A is a bounded set 
if and only if for every f in &*, sup{|f(a)|: a € A} < 00. 


PROOF. Consider % as a subset of @(2*,F) (= &**) by letting x(f) = 
f(x) for every f in #*. Since 2* is a Banach space and ||x|| = ||X|| for all 
x, the corollary is a special case of the PUB. EY 


14.4. Corollary. Jf % is a Banach space and A C &*, then A is a bounded 
set if and only if for every x in 2, sup{|f(x)|: f © A} < oo. 


PROOF. Consider 2* as @(2,F). fa 


Using Corollary 14.3, it is possible to prove the following improvement of 
(14.1). 


14.5. Corollary. If 2 is a Banach space and Y is a normed space and if 
AC Bi X,Y) such that for every x in & and g in Y*, 


sup{|g(A(x))|: 4 ©} < 0, 
then sup{||A||: A € 2} < o. 
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PROOF. Fix x in %. By the hypothesis and Corollary 14.3, sup{||A(x)||: 
A € XL} < ow. By (14.1), sup{||Al|: A EW} <0. ff 


A special form of the PUB that is quite useful is the following. 


14.6. The Banach-Steinhaus Theorem. /f 2 and ¥Y are Banach spaces and 
{A,,} is a sequence in B(2,Y) with the property that for every x in & there 
isa yin Y such that \|A,,.x — y|| > 0, then there is an A in B( X,Y) such 
that ||A,x — Ax|| > 0 for every x in & and sup||A,,|| < 00. 


Proor. If x € #, let Ax = lim A,x. By hypothesis A: #— Y is defined 
and it is easy to see that it is linear. To show that A is bounded, note that 
the PUB implies that there is a constant M > 0 such that ||A,|| < © for all 
n. If x © & and ||x|| <1, then for any n> 1, ||Ax|| < ||Ax — A,x|| + 
||A,,x|| < ||4x — A,x|| + M. Letting n— o shows that |/Ax|| < M 
whenever ||x|| < 1. cs 


The Banach-—Steinhaus Theorem is a result about sequences, not nets. 
Note that if 7 is the identity operator on % and for eachn >1, A, =n‘ 
and for n < 0, A, = nl, then { A,: n © Z} is a countable net that converges 
in norm to QO, but the net is not bounded. 


14.7. Proposition. Let X be locally compact and let { f,,\ be a sequence in 
C)(X). Then ff, dp — [fdu for every p in M(X) if and only if sup,]||f,,|| < °c 
and f, (x) > f(x) for every x in X. 


PROOF. Suppose {f,du — {fd for every w» in M(X). Since M(X)= 
C,( X)*, (14.3) implies that sup,]||f,|| < oo. By letting uy = 6,, the unit point 
mass at x, we see that {f, dd, = f,(x) > f(x). The converse follows by the 
Lebesgue Dominated Convergence Theorem. is 


EXERCISES 


1. Here is another proof of the PUB using the Baire Category Theorem. With the 
notation of (14.1), let B, = {x © %: ||Ax|| <n for all A in 7}. By hypothesis, 
Us_,B, = &. Now apply the Baire Category Theorem. 


2. If 1<p< oo and {x,}C/’, then L%,x,(/)y(J) > O for every y in /%, 
1/p + 1/q = 1, if and only if sup,||x,||, < co and x,(j) > 0 for every j > 1. 


3. If {x,} CV, then ©°,x,(/)y(y) 2 0 for every y in co if and only if 
sup,,||x,||, < co and x,(j) > 0 for every j > 1. 


4. If (X,Q,p) is a measure space, 1 < p < o, and {f,} C L?(X,0,p), then 


{fgdu — 0 for every g in Li(u), 1/p + 1/q = 1, if and only if sup{|l/,|l,: 
n > 1} < oo and for every set E in 2 with p(E) < 0, fyf,du—> 0asn- oo. 


5. If (X,2,p) is a o-finite measure space and { f,} is a sequence in L'( X, 2,1), 
then {f,gdpu — 0 for every g in L*(p) if and only if sup{|[f,||,;: 7 = 1} < oc 
and /,f, du — 0 for every E in 22. 
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6. 


10. 


Let # be a Hilbert space and let & be an orthonormal basis for #. Show that 
a sequence {h,,} in # satisfies (h,,h) — 0 for every h in 3% if and only if 
sup{||H,,||: 2 = 1} < oo and (h,,e) — 0 for every e in @. 


. If X is locally compact and {p,,} is a sequence in M(X), then L(p,) > 0 for 


every L in M(X)* if and only if sup{||p,,||: 2 = 1} < oo and p,(F) > 0 for 
every Borel set E. 


_ In (14.6), show that ||A|| < lim sup]]A, ||. 


. If (S,d) is a metric space and % is a normed space, say that a function 


f: S—>& 1s a Lipschitz function if there is a constant M > 0 such that 
lIf(x) — f(t)|| < Md(s,t) for all s,¢ in S. Show that if f: S —~ % is a function 
such that for all L in #*, Lof: S > F is Lipschitz, then f: S ~&% isa 
Lipschitz function. 


Let 2 be a Banach space and suppose { x,,} is a sequence in % that is linearly 
independent and such that for each x in % there are scalars {a,,} such that 
lim,,_. {|x — L%-,a,x;,|| = 0. Such a sequence is called a basis. (a) Prove that 
# is separable. (b) Let Y= {{a,} © FX: L%_,a,x, converges in #} and for 
y = (a, } in Y define ||y|| = sup, |[L7_,a,x,||. Show that Y is a Banach space. 
(c) Show that there is a bounded biection T: #— Y. (d) If n> 1 and f:: 
X— F is defined by f,(L%_,a,x,) = a,, show that f, © #*. (e) Show that 
x, € the closed linear span of {x,: k # n}. 


CHAPTER IV 


Locally Convex Spaces 


A topological vector space is a generalization of the concept of a Banach 
space. The locally convex spaces are encountered repeatedly when discuss- 
ing weak topologies on a Banach space, sets of operators on Hilbert space, 
or the theory of distributions. This book will only skim the surface of this 
theory, but it will treat locally convex spaces in sufficient detail as to enable 
the reader to understand the use of these spaces in the three areas of 
analysis just mentioned. For more details on this theory, see Bourbaki 
[1967], Robertson and Robertson [1966], or Schaefer [1971]. 


§1. Elementary Properties and Examples 


A topological vector space is a vector space that is also a topological space 
such that the linear structure and the topological structure are vitally 
connected. 


1.1. Definition. A topological vector space (TVS) is a vector space & 
together with a topology such that with respect to this topology 
(a) the map of 2 xX &-— 4 defined by (x, y) > x + y is continuous; 
(b) the map of F xk 2-— & defined by (a, x) +> ax is continuous. 
It is easy to see that a normed space is a TVS (Proposition II.1.3). 


Suppose # is a vector space and F is a family of seminorms on #. Let 7 
be the topology on 2% that has as a subbase the sets {x: p(x — x9) < €}, 
where p © Y, x, © X, and e > 0. Thus a subset U of Z is open if and only 
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if for every x, in U there are p,,..., p, mn FY and «,,...,¢€, > 0 such that 
Nui{x ©: p(x — x9) <e,} © U. It is not difficult to show that 2 with 
this topology is a TVS (Exercise 2). 


1.2. Definition. A locally convex space (LCS) is a TVS whose topology is 
defined by a family of seminorms ¥ such that, <a{x: p(x) = 0} = (0). 


The attitude that has been adopted in this book is that all topological 
spaces are Hausdorff. The condition in Definition 1.2 that), -o{x: p(x) 
= 0} = (Q) is imposed precisely so that the topology defined by FY be 
Hausdorff. In fact, suppose that x # y. Then there is a p in F such that 
p(x — y) #0; let p(x —y) > e > 0. If U= {z: p(x — z) < Se} and V = 
{z: p(y — z) < 3e}, then UN V = Cand U and V are neighborhoods of x 
and y, respectively. 

If % isa TVS and x, € &, then x > x + x, is a homeomorphism of 2; 
also, if a © F and a #0, x ~ ax is a homeomorphism of 2 (Exercise 4). 
Thus the topology of 2% looks the same at any point. This might make the 
next statement less surprising. 


1.3. Proposition. Let Z be a TVS and let p be a seminorm on 2. The 
following statements are equivalent. 


(a) p is continuous. 

(b) {x © £: p(x) < 1} is open. 

(c) 0 € int{x © Z: p(x) < 1}. 

(d) 0 € int{x © 2: p(x) < 1}. 

(€) p is continuous at 0. 

(f) There is a continuous seminorm q on & such that p < q. 


PROOF. It is clear that (a) > (b) = (c) = (d). 

(d) implies (e): Clearly (d) implies that for every e > 0, 0 € int{x © Z: 
P(x) < €}; soif {x,} is a net in 2 that converges to 0 and e > 0, there is 
an i, such that x, © {x: p(x) < €} fori > ip; that is, p(x;) < e fori = ig. 
So p is continuous at 0. 

(e) implies (a): If x; > x, then |p(x) — p(x,)| < p(x — x;). Since x — x, 
— 0, (e) implies that p(x — x,;) ~ 0. Hence p(x;) > p(x). 

Clearly (a) implies (f). So it remains to show that (f) implies (e). If x; > 0 
in 2, then g(x,) > 0. But 0 < p(x;) < q(x;), so p(x;) > 9. a 


1.4. Proposition. If Z is a TVS and p,,..., p, are continuous seminorms, 
then p, + +--+ +p, and max,( p,(x)) are continuous seminorms. If { p;} is a 
family of continuous seminorms such that there is a continuous seminorm q 
with p; < q for all i, then x > sup,{ p,(x)} defines a continuous seminorm. 
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PROOF. Exercise. 


If P is a family of seminorms on 2% that makes % into a LCS, it is often 
convenient to enlarge FY by assuming that F is closed under the formation 
of finite sums and supremums of bounded families [as in (1.4)]. Sometimes it 
iS convenient to assume that F consists of all continuous seminorms. In 
either case the resulting topology on 2 remains unchanged. 


1.5. Example. Let X be completely regular and let C(X) = all continuous 
functions from X into F. If K is a compact subset of X, define p,(f) = 
sup{|f(x)|: x © K }. Then { p,: K compact in X} is a family of semi- 
norms that makes C(.X) into a LCS. 


1.6. Example. Let G be an open subset of C and let H(G) be the subset of 
C.(G) consisting of all analytic functions on G. Define the seminorms of 
(1.5) on A(G). Then H(G) is a LCS. Also, the topology defined on H(G) 
by these seminorms is the topology of uniform convergence on compact 
subsets—the usual topology for discussing analytic functions. 


1.7. Example. Let 2 be a normed space. For each x* in 2%, define 
Py»(X) = |x*(x)|. Then p,. is a seminorm and if P= { pws x* © 4*\, P 
makes % into a LCS. The topology defined on % by these seminorms is 
called the weak topology and is often denoted by o( 2%, %*). 


1.8. Example. Let 2 be a normed space and for each x in % define p,: 
#* — [0, 00) by p,(x*) = |x*(x)|. Then p, is a seminorm and Y= { p,: 
x € £} makes 2* into a LCS. The topology defined by these seminorms is 
called the weak-star (or weak* or wk*) topology on #*. It is often denoted 
by 0( %*, &). 


The spaces 2 with its weak topology and %* with its weak* topology are 
very important and will be explored in depth in Chapter V. 

Recall the definition of convex set from (1.2.4). If a,b € 2, then the line 
segment from a to b is defined as [a, b] = {th + (1 —t)a:0<t<1}.Soa 
set A is convex if and only if [a,b] C A whenever a,b € A. The proof of 
the next result is left to the reader. 


1.9. Proposition. (a) A set A is convex if and only if whenever x,,...,X, © A 
and t,,...,t,, © [0,1] with Ut; = 1, then L t,x, € A.(b) If {Aj 1 I} isa 


collection of convex sets, then ( ,A, is convex. 


1.10. Definition. If A C %, the convex hull of A, denoted by co( A), is the 
intersection of all convex sets that contain A. If ¥ is a TVS, then the closed 
convex hull of A is the intersection of all closed convex subsets of % that 
contain A; it is denoted by co( A). 
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Since a vector space is itself convex, each subset of % is contained in a 
convex set. This fact and Proposition 1.9(b) imply that co( A) is well defined 
and convex. Also, co(A) is a closed convex set. 

If 2 is a normed space, then { x: ||x|| <1} and {x: ||x|] <1} are both 
convex sets. If fe 2*, {x: |f(x)| < 1}, (x: Re f(x) < 1}, {x: Ref(x) > 1} 
are all convex. In fact, if 7: 2— Y is a real linear map and C is a convex 
subset of Y, then T~1(C) is convex in &. 


1.11. Proposition. Let & be a TVS and let A be a convex subset of &. Then 
(a) cl A is convex; (b) ifa € int A and b € cl A, then [a,b) = {th + (1 - 
tha;0<t<1} Cinta. 


PROOF. Let a © A, bE clA, and 0 <t <1. Let {x,} be a net in A such 
that x, > 6. Then tx, + (1 — t)a > tb + (1 — t)a. This shows that 


1.12 b incl A and a in A imply [a,b] C cl A. 


Using (1.12) it is easy to show that cl A is convex. To prove (b), fix ¢, 
0 <¢<1,and put c = tbh + (1 — t)a, where a € int A and b € cl A. There 
is an open set V in 2 such that 0 € V and a+ VC A. (Why?) Hence for 
any d in A 


A 


lU 


td+(1-—t)(a+V) 
t(d-—b)+th+(1-t)(at+V) 
[t(d-b)+(1-t)V] +c. 


If it can be shown that there is an element d in A such that 0 € t(d — b) + 
(1 — t)V = U, then the preceding inclusion shows that c € int A since U is 
open (Exercise 4). Note that the finding of such a d in A is equivalent to 
finding a d such thatO © t 311 —-t1)V + (d-—b)ordeb—t ‘1 — wy. 
But 0 © —r 4(1 — £)V and this set is open. Since b € cl A, d can be found 
in A. a 


1.13. Corollary. If A C %, then co( A) is the closure of co( A). 


A set A C & is balanced if ax © A whenever x € A and |a| < 1. A set 
A is absorbing if for each x in @ there is an e > O such that tx € A for 
0 < t<e. Note that an absorbing set must contain the origin. If a € A, 
then A is absorbing at a if the set A — a is absorbing. Equivalently, A is 
absorbing at a if for every x in 2 there is an e > 0 such thata + tx EG A 
forO <t<e. 

If % is a vector space and p is a seminorm, then V = {x: p(x) < l}isa 
convex balanced set that is absorbing at each of its points. It is rather 
remarkable that the converse of this is true. This fact will be used to give an 
abstract formulation of a LCS and also to explore some geometric conse- 
quences of the Hahn—Banach Theorem. 
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1.14. Proposition. Jf % is a vector space over F and V is a convex, balanced 
set that is absorbing at each of its points, then there is a unique seminorm p on 
¥ such that V = {x © &: p(x) < 1}. 


PROOF. Define p(x) by 

p(x) =inf{t:¢>Oandx erV}. 
Since V is absorbing, 4 = U%_,nV, so that the set whose infimum is p(x) is 
nonempty. Clearly p(0) = 0. To see that p(ax) = |alp(x), we can suppose 
that a # 0. Hence, because V is balanced, 


p(ax) = inf{t>0:axetV} 


= inf{ 1 > 0: reds v)| 
inf|¢ > 0: ve -v)| 
= lang oy nid 


= |alp(x). 
To complete the proof that p is a seminorm, note that if a, 6 > 0 and 
a,b € V, then 


aa + Ab = (a+ B)( —* +4 + Eble (a+ BV 


by the convexity of V. If x,y © %, p(x) =a, and p(y)= B, let 6 > 0. 
Then x € (a+ 5)V and y €(B + 5)V. (Why?) Hence x + y €(at+ dV 
+(B+65V=(a+B8+26)V (Exercise 11). Letting 6 > 0 shows that 
p(x+y)<sa+t B= p(x) + p(y). 

It remains to show that V = { p(x) < 1}. If p(x) =a <1, thena < B 
< 1 implies x € BV C V since V is balanced. Thus V 3 { x: p(x) < 1}. If 
x € V, then p(x) < 1. Since V is absorbing at x, there is an e > O such 
that for O<t<e, x+ux=yeEV. But x=(1+12) 'y, ye V. Hence 
p(x)=( +t) ‘p(y stn ' <1. 

Uniqueness follows by (III.1.4). = 


The seminorm p defined in the preceding proposition is called the 
Minkowski functional of V or the gauge of V. 

Note that if 2 is a TVS space and V is an open set in 2%, then V is 
absorbing at each of its points. 

Using Proposition 1.14, the following characterization of a LCS can be 
obtained. The proof is left to the reader. 


1.15. Proposition. Let & be a TVS and let W be the collection of all open 
convex balanced subsets of &. Then & is locally convex if and only if U is a 
basis for the neighborhood system at 0. 
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EXERCISES 


1. 


13. 


14. 


15. 


Let 2 be a TVS and let W be all the open sets containing 0. Prove the 
following. (a) If U € &, there isa V in Y such thatV+ VC U. (b) If UEY, 
there isa V in Y such that V C U and aV C V for all |a| < 1. (V is balanced.) 
(Hint: If We WY and aW C U for |a| < «, then eW C BU for |B| = 1.) 


. Show that a LCS is a TVS. 
. Suppose that % is a TVS but do not assume that Z% is Hausdorff. (a) Show that 


& is Hausdorff if and only if the singleton set {0} is closed. (b) If % is 
Hausdorff, show that 2% is a regular topological space. 


. Let & be a TVS. Show: (a) if x» € %, the map x > x + x, is a homeomor- 


phism of % onto %; (b) if a € F and a # 0, the map x > ax is a homeomor- 
phism. 


. Prove Proposition 1.4. 


. Verify the statements made in Example 1.5. Show that a net {f,} in C(X) 


converges to f if and only if f, — f uniformly on compact subsets of X. 


. Show that the space H(G) defined in (1.6) is complete. (Every Cauchy net 


converges.) 


. Verify the statements made in Example 1.7. Give a basis for the neighborhood 


system at 0. 


. Verify the statements made in Example 1.8. 
. Prove Proposition 1.9. 


. Show that if A is a convex set and a, B > 0, then aA + BA = (a+ B)A. Give 


an example of a nonconvex set A for which this is untrue. 


. If & is a TVS and A is closed, show that A is convex if and only if 


+(x + y) © A whenever x and y € A. 


Let s = the space of all sequences of scalars. Thus s = all functions x: N > F. 
Define addition and scalar multiplication in the usual way. If x, y © s, define 


— S yen_ Oy) = (| 
MP) BTS p(y) 


Show that d is a metric on s and that with this topology s is a TVS. Also show 
that s is complete. 


Let (X, 2, ) be a finite measure space, let 4% be the space of {2-measurable 
functions, and identify two functions that agree ae. [yw]. If f, g © 4, define 


ed SBN... 
1+|f-al 


Then d is a metric on 4 and (.4, d) is a complete TVS. Is there a relationship 
between this example and the space s of Exercise 13? 


d(f,g)= 


If isa TVS and A C &, then cl A =(/{A + V: 0 € V and V is open}. 


108 IV. Locally Convex Spaces 


16. If & is a TVS and -@ is a closed linear space, then 2/4 with the quotient 
topology is a TVS. If p is aseminorm on 2, define p on 4/4 by p(x + @) = 
inf{ p(x + y): y € M#}. Show that p is a seminorm on 2/4. Show that if 2% is 
a LCS, then so is 2/4. 


17. If {@%: i € 7} is a family of TVS’s, then @=[]1{4;: i © J} with the product 
topology is a TVS. If each %, is a LCS, then so is #. If is a LCS, must each 
#, be a LCS? 


18. If & is a finite-dimensional vector space and 7,,.7, are two topologies on % 
that make % into a TVS, then 7, = 7,. 


19. If & is a TVS and .4 1s a finite-dimensional linear manifold in %, then -@ is 
closed and ¥ + .@ is closed for any closed subspace ¥Y of %. 


20. Let & be any infinite-dimensional vector space and let 7 be the collection of 
all subsets W of & such that if x € W, then there is a convex balanced set U 
with x + UC W and UN 4 open in @ for every finite-dimensional linear 
manifold 4 in &. (Each such -@ is given its usual topology.) Show: (a) (%, 7 ) 
is a LCS; (b) a set F is closed in 2 if and only if F 1-4 1s closed for every 
finite-dimensional subspace 4 of 2%; (c) if Y is a topological space and f: 
#— Y (not necessarily linear), then f is continuous if and only if f|7@ is 
continuous for every finite-dimensional space 4%; (d) if Y is a TVS and 7: 
& — Y is a linear map, then T is continuous. 


21. Let X be a locally compact space and for each ¢ in C)(X), define p,(f) = 
loll. for f in C,(X). Show that p, is a seminorm on C,(X). Let B = the 
topology defined by these seminorms. Show that (C,(X), 8B) is a LCS that is 
complete. B is called the strict topology. 


22. For 0 < p <1, let /? = all sequences x such that L°_,|x(n)/? < oo. Define 
d(x, y) = X%_,|x(n) — y(n)? (no pth root). Then d is a metric and (/’, d) is 
a TVS that is not locally convex. 


23. Let ® and &Y be locally convex spaces and let T: 4— Y be a linear 
transformation. Show that T is continuous if and only if for every continuous 
seminorm p on Y, peT is a continuous seminorm on 2. 


24. Let # be a LCS and let G be an open connected subset of %. Show that G is 
arcwise connected. 


§2. Metrizable and Normable Locally Convex Spaces 


Which LCS’s are metrizable? That is, which have a topology which is 
defined by a metric? Which LCS’s have a topology that is defined by a 
norm? Both are interesting questions and both answers could be useful. 

If A is a family of seminorms on 2% and % is a TVS, say that P 
determines the topology on % if the topology of % is the same as the 
topology induced by F. 
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2.1. Proposition. Let { p,, p,,...} be a sequence of seminorms on % such 
that (\_,{x: p,(x) = 0} = (0). For x andy in 2, define 


Ss nen PAX -Y) 
ney) a? 1+ p,(x —y) 


Then d is a metric on & and the topology on & defined by d is the topology on 
& defined by the seminorms { p,, P2,...}. Thus a LCS is metrizable if and 
only if its topology is determined by a countable family of seminorms. 


PROOF. It is left as an exercise for the reader to show that d is a metric and 
induces the same topology as { p,}. If 2 is a LCS and its topology is 
determined by a countable family of seminorms, it is immediate that 2 is 
metrizable. For the converse, assume that 2 is metrizable and its metric is 
p. Let U, = {x: p(x,0)< 1/n}. Because 2% is locally convex, there are 
continuous seminorms g,,...,g, and positive numbers ¢,,...,€, such that 
A i{x: g(x)<e} CU, If p,=e'q, +--+ +ex'q,, then x € U, 
whenever p,(x) < 1. Clearly, p,, is continuous for each n. Thus if x; > 0 in 
4, then for each n, p,(x,;) > 0 as j — oo. Conversely, suppose that for 
each n, p,(x;) > Oas j > o.If e > 0, let n > e *. Then there is a jy) such 
that for j = jo, p,(x,;) < 1. Thus, for j = jo, x; © U, © (x: p(x, 0) < €}. 
That is, p(x,;,0) < e for j = jp and so x; > 0 in 2. This shows that { p, } 
determines the topology on 2%. (Why?) | 


2.2. Example. If C(X) is as in Example 1.5, then C(X) is metrizable if and 
only if X = U%_,K,, where each K, is compact, K, C K,C --:,andif K 
is any compact subset of X, then K C K, for some n. 


2.3. Example. If X is locally compact and C(X) is as in Example 1.5, then 
C( X) is metrizable if and only if X is o-compact (that is, X is the union of 
a sequence of compact sets). If H(G) is as in Example 1.6, then H(G) is 
metrizable. 


If & is a vector space and d is a metric on 2@, say that d is translation 
invariant if d(x + z, y + z) = d(x, y) for all x, y,z in 2. Note that the 
metric defined by a norm as well as the metric defined in (2.1) are 
translation invariant. 


2.4. Definition. A Frechet space isa TVS 2 whose topology is defined by 
a translation invariant metric d and such that (2%, d) is complete. 


It should be pointed out that some authors include in the definition of a 
Frechet space the assumption that Z is locally convex. 


2.5. Definition. If Z is a TVS and B C @, then B 1s bounded if for every 
open set U containing 0, there is an e > O such that eB C U. 
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If % is a normed space, then it is easy to see that a set B is bounded if 
and only if sup{||b||: 5 © B} < 00, so the definition is intuitively correct. 

Also, notice that if || - || is a norm, { x: ||x|| < 1} 1s itself bounded. This is 
not true for seminorms. For example, if C(IR) is topologized as in (1.5), let 
P(f) = sup{|f(2)|: 0 < ¢ < 1}. Then p is a continuous seminorm. How- 
ever, { f: p(f) < 1} is not bounded. In fact, if fy 1s any function in C(R) 
that vanishes on [0,1], {afp: aE R}C{f: p(f)<1}. The fact that a 
normed space possesses a bounded open set is characteristic. 


2.6. Proposition. Jf 2 is a LCS, then & is normable if and only if & has a 
bounded open set. 


PROOF. It has already been shown that a normed space has a bounded open 
set. So assume that % is a LCS that has a bounded open set U. It must be 
shown that there is norm on @ that defines the same topology. By 
translation, it may be assumed that 0 © U (see Exercise 41). By local 
convexity, there is a continuous seminorm p such that {x: p(x) <1}=V 
Cc U (Why?). It will be shown that p is a norm and defines the topology 
on Z. 

To see that p is a norm, suppose that x © 4, x #0. Let W,W. be 
disjoint open sets such that 0 © W, and x € W.. Then there is an ¢ > 0 
such that W, D eU D eV. But VV = {y: p(y) < €}. Since x € W, p(x) > «. 
Hence p is a norm. 

Because p is continuous on 2, to show that p defines the topology of 2 
it suffices to show that if g is any continuous seminorm on 2%, there is an 
a > 0 such that g < ap (Why?). But because g is continuous, there is an 
e > O such that {x: q(x) < 1} D UD eV. That is, p(x) < € implies g(x) 
<1. By Lemma III.1.4,g<e 'p. & 


EXERCISES 

1. Supply the missing details in the proof of Proposition 2.1. 
2. Verify the statements in Example 2.2. 

3. Verify the statements in Example 2.3. 
4 


. Let & be a TVS and prove the following: (a) If B is a bounded subset of 2, then 
so is cl B. (b) The finite union of bounded sets is bounded. (c) Every compact set 
is bounded. (d) If B C 2, then B is bounded if and only if for every sequence 
{x, } contained in B and for every {a,,} in cy, a,x, > Oin &. (e) If Y isa 
TVS, T: #- Y is a continuous linear transformation, and B is a bounded 
subset of 2, then T(B) is a bounded subset of ¥. (f) If @ isa LCS and Bc 2, 
then B is bounded if and only if for every continuous seminorm p, sup{ p(b): 
b € B} < oo. (g) If & is a normed space and B € &, then B is bounded if and 
only if sup{||b||: 6 & B} < oo. (h) If @ is a Frechet space, then bounded sets 
have finite diameter, but not conversely. (1) The translate of a bounded set is 
bounded. 
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5. If is a LCS, show that 2 is metrizable if and only if 2% is first countable. Is 
this equivalent to saying that {0} 1s a Gs set? 


6. Let X be a locally compact space and give C,(X) the strict topology defined in 
Exercise 1.21. Show that a subset of C,(X) is B-bounded if and only if it is norm 
bounded. 


7. With the notation of Exercise 6, show that (C,(X), 8) is metrizable if and only if 
X is compact. 


8. Prove the Open Mapping Theorem for Frechet spaces. 


§3. Some Geometric Consequences of the 
Hahn-Banach Theorem 


In order to exploit the Hahn—Banach Theorem in the setting of a LCS, it is 
necessary to establish some properties of continuous linear functionals. The 
proofs of the relevant propositions are similar to the proofs of the corre- 
sponding facts about linear functionals on normed spaces given in §III.5. 
For example, a hyperplane in a TVS is either closed or dense (see III.5.2). 
The proof of the next fact is similar to the proof of (III.2.1) and CIII.5.3) 
and will not be given. 


3.1. Theorem. Jf % is a TVS and f: 2- F is a linear functional, then the 
following statements are equivalent. 


(a) f is continuous. 

(b) fis continuous at 0. 

(c) f is continuous at some point. 
(d) ker f is closed. 


(e) x > |f(x)| is a continuous seminorm. 


If & isa LCS and F is a family of seminorms that defines the topology on 
&, then the statements above are equivalent to the following: 


(f) There are p,,...,p, in FY and positive scalars a,,...,a, such that 


f(x)| S Lp a1%% P(X) for all x. 


The proof of the next proposition is similar to the proof of Proposition 
1.14 and will not be given. 


3.2. Proposition. Let be a TVS and suppose that G is an open convex 
subset of & that contains the origin. If 


q(x) =inf{t:t> Oand x € 1G}, 


then q is a non-negative continuous sublinear functional and G = { x: q(x) < 


1}. 
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Note that the difference between the preceding proposition and (1.14) is 
that here G is not assumed to be balanced and the consequence is a 
sublinear functional (q(ax) = ag(x) if a => 0) that is not necessarily a 
seminorm. 

The geometric consequences of the Hahn—Banach Theorem are achieved 
by interpreting that theorem in light of the correspondence between linear 
functionals and hyperplanes and between sublinear functionals and open 
convex neighborhoods of the origin. The next result is typical. 


3.3. Theorem. Jf 2 is a TVS and G is an open convex nonempty subset of & 
that does not contain the origin, then there is a closed hyperplane 4 such that 
MAG =O. 


PrRooF. Case J]. & is an R-linear space. Pick any x, in G and let 
H = x. — G. Then H is an open convex set containing 0. (Verify). By (3.2) 
there is a continuous nonnegative sublinear functional g: 2— R such that 
H = {x: q(x) < 1}. Since x) € H, q(xo) = 1. 

Let Y= {ax,: a © R} and define fp: Y> R by fo(ax,) = ag(xo). If 
a > 0, then f(ax,.) = aq(x,) = q(axy); if a < 0, then f(ax,.) = aq(xo) < 
a<0< q(ax,). So fo < qon®. Let f: 2— R be a linear functional such 
that f|[Y= fp and f<qon 2%. Put Y= kerf. 

Now if x © G, then x, —- x © H and so f(x,) — f(x) = f(x) - x) < 
q(X9 — x) < 1. Therefore f(x) > f(x9) — 1 = q(x9) — 1 = 0 for all x in 
G. Thus 40 G=0. 

Case 2. & is a C-linear space. Lemma III.6.3 will be used here. Using 
Case 1 and the fact that % is also an R-linear space, there is a continuous 
R-linear functional f: 2— R such that GN kerf=O. If F(x) = f(x) - 
if(ix), then F is a C-linear functional and f= ReF (III.6.3). Hence 
F(x) = 0 if and only if f(x) = f(ix) = 0; that is, @W= kerF = kerfo 
[i ker f]. So .@ is a closed hyperplane and 41 G =O. = 


An affine hyperplane in @ is a set -@ such that for every x, in .4, 
M — x, is a hyperplane. (See Exercise 3.) An affine manifold in % isa set Y 
such that for every x) in Y, Y— x, is a linear manifold in Z. An affine 
subspace of a TVS & is a closed affine manifold. 


3.4. Corollary. Let be a TVS and let G be an open convex nomempty 
subset of &. If Y is an affine subspace of & such that YO G = (, then there 
is a closed affine hyperplane M in & such that YC HM and MNVG=0. 


PROOF. By considering G — x, and &— x, for any x, in Y, it may be 
assumed that ¥Y is a linear subspace of 2. Let 0: % —~ 2/¥Y be the natural 
map. Since Q~'(O(G))={y+G: y]¥Y}, Q(G) is open in 2/Y. It is 
easy to see that Q(G) is also convex. Since ¥N G =O, 0 € Q(G). By the 
preceding theorem, there is a closed hyperplane WY in 2/¥Y such that 
NO O(G)=0. Let W=Q°1(/). It is easy to check that .# has the 
desired properties. ia 
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There is a great advantage inherent in a geometric discussion of real 
TVS’s. Namely, if f: 2— R is a nonzero continuous R-linear functional, 
then the hyperplane ker f disconnects the space. That is, 2\ kerf has two 
components (see Exercises 4 and 5). It thus becomes convenient to make the 
following definitions. 


3.5. Definition. Let 2 be a real TVS. A subset S of % is called an open 
half-space if there is a continuous linear functional f: 2@— R such that 
S={x eZ: f(x)> a} for some a. S is a closed half-space if there is a 
continuous linear functional f/f: 2— R such that S= {x © 2%: f(x) => a} 
for some a. 


Two subsets A and B of & are said to be strictly separated if they are 
contained in disjoint open half-spaces; they are separated if they are 
contained in two closed half-spaces whose intersection is a closed affine 
hyperplane. 


3.6. Proposition. Let Z be a real TVS. 


(a) The closure of an open half-space is a closed half-space and the interior of 
a closed half-space is an open half-space. 

(b) If A, B C &, then A and B are strictly separated (separated ) if and only 
if there is a continuous linear functional f: 2 — R and a real scalar a such 
that f(a) > a for allain A and f(b) < a for all b in B (f(a) = a@ for all 
ain A and f(b) < a for all b in B). 


PROOF. Exercise 6. 


In many ways, the next result is the most important “separation” theorem 
as the other separation theorems follow from this one. However, the most 
used separation theorem is Theorem 3.9 below. 


3.7. Theorem. Jf % is a real TVS and A and B are disjoint open convex 
subsets of &, then A and B are strictly separated. 


ProoF. Let G=A-B={a-—b: a€ A,b€ B}; it is easy to verify that 
G is convex (do it!). Also, G = U{A — b: b © B}, so G is open. Moreover, 
because A ™ B = D0, 0 € G. By Theorem 3.3 there is a closed hyperplane .Z 
in & such that #41 G =O. Let f: 2-R bea linear functional such that 
M = kerf. Now f(G) is a convex subset of R and 0 € f(G). Hence either 
f(x) > 0 for all x in G or f(x) < 0 for all x in G; suppose f(x) > 0 for all 
x in G. Thus if a€ A and bE B, 0 < f(a — b) = f(a) — f(b); that is, 
f(a) > f(b). Therefore there is a real number a such that 


sup{ f(b): b€ B} <a<inf{ f(a): a€ A}. 


But f(A) and f(B) are open intervals (Exercise 7), so f< aon B and 
f>aon A. = 
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3.8. Lemma. If Z is a TVS, K is a compact subset of 2, and V is an open 
subset of & such that K C V, then there is an open neighborhood of 0, U, such 
that K+ UC V. 


Proor. Let %, = all of the open neighborhoods of 0. Suppose that for each 
U in Y%, K + U is not contained in V. Thus, for each U in %, there is a 
vector x, in K and a y,, in U such that x, + yy © ¥\ V. Order Y, _ by 
reverse inclusion; that is, U, => U, if U, C U,. Then %, is a directed set and 
{x,} and { y,,} are nets. Now y, > 0 in 2. Because K is compact there is 
an x in K such that x,-grx ({ xy } clusters at x). Hence x,, + yy-grx + 0 
= x. (Why?) Hence x € ch(%@\ V) = 2\ V, a contradiction. 5 


The condition that K be compact in the preceding lemma is necessary; it 
is not enough to assume that K is closed. (What is counterexample?) 


3.9. Theorem. Let % be a real LCS and let A and B be two disjoint closed 
convex subsets of &. If B is compact, then A and B are strictly separated. 


PRooF. By hypothesis, B is a compact subset of the open set 2@\ A. The 
preceding lemma implies there is an open neighborhood U, of 0 such that 
B+ U,C2%\A. Because % is locally convex, there is a continuous semi- 
norm p on & such that {x: p(x) <1} C U,. Put U= {x: p(x) < 4}. 
Then (B+ U)N(A + U) =O (Verify!), and A + U and B+ U are open 
convex subsets of # that contain A and B, respectively. (Why?) So the 
result follows from Theorem 3.7. ai 


The fact that one of the two closed convex sets in the preceding theorem 
is assumed to be compact is necessary. In fact, if = R?, A = {(x, y) © R?: 
y <0}, and B= {(x,y)€R*: y>x~*}, then A and B are disjoint 
closed convex subsets of R* that cannot be strictly separated. 

The next result generalizes Corollary III.6.8, though, of course, the metric 
content of (III.6.8) is missing. 


3.10. Corollary. Jf & is a real LCS, A is a closed convex subset of X, and 
x € A, then x is strictly separated from A. 


3.11. Corollary. If & is a real LCS and AC, then co(A) is the 
intersection of the closed half-spaces containing A. 


Proor. Let # be the collection of all closed half-spaces containing 4. 
Since each set in # is closed and convex, co(A) CH: H © #}. On the 
other hand, if x, € co(A), then (3.10) implies there is a continuous linear 
functional f: 2— R and an a in R such that f(x.) > a and f(x) < a for 
all x in co(A). Thus H = {x: f(x) < a} belongs to # and x, € H. fe 


The next result generalizes Theorem III.6.13. 
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3.12. Corollary. If Z is a real LCS and A C &, then the closed linear span 
of A is the intersection of all closed hyperplanes containing A. 


If Z is a complex LCS, it is also a real LCS. This can be used to 
formulate and prove versions of the preceding results. As a sample, the 
following complex version of Theorem 3.9 is presented. 


3.13. Theorem. Let % be a complex LCS and let A and B be two disjoint 
closed convex subsets of &. If B is compact, then there is a continuous linear 
functional f: # > C, an ain R, and an « > 0 such that for ain A and b in B, 


Ref(a)<a<a+te<Ref(b). 


3.14. Corollary. If & is a LCS and ¥Y is a linear manifold in &, then Y is 
dense in & if and only if the only continuous linear functional on & that 
vanishes on Y is the identically zero functional. 


3.15. Corollary. If % is a LCS, Y is a closed linear subspace of 2, and 
Xo © F\ Y, then there is a continuous linear functional f: &— F such that 
f(y) = 0 for all y in Y and f(x) = 1. 


These results imply that on a LCS there are many continuous linear 
functionals. Compare the results of this section with those of §III.6. 

The hypothesis that 2 is locally convex does not appear in the results 
prior to Theorem 3.9. The reason for this is that in the preceding results the 
existence of an open convex subset of % is assumed. In Theorem 3.9 such a 
set must be manufactured. Without the hypothesis of local convexity it may 
be that the only open convex sets are the whole space itself and the empty 
set. 


3.16. Example. For 0 < p < 1, let L (0,1) be the collection of equivalence 
classes of measurable functions f: (0,1) — R such that 


(f)),= [Uo ax < 00. 


It will be shown that d(f, g) = ((f— g)), is a metric on L?(0,1) and that 
with this metric L (0,1) is a Frechet space. It will also be shown, however, 
that L?(0,1) has only one nonempty open convex set, namely itself. So 
L?(0,1), 0 < p < 1, is most emphatically not locally convex. The proof of 
these facts begins with the following inequality. 


3.17 For s,tin [0,00) andO <p <1,(st+t)? <s?4+12?. 
To see this, let f(t)=s?+t?—(s+t)? for t>0, s fixed. Then 


f(t) = pt?-* — p(s + t)?*. Since p-—1<0 and s+it>t, f(t)> 0. 
Thus 0 = f(0) < f(t). This proves (3.17). 
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If d(f,g)=(f- g)), for f,g in L?(0,1), then (3.17) implies that 
d(f,g)<d(f,h) + d(h, g) for all f, g,h in L?(0,1). It follows that d is a 
metric on L (0,1). Clearly d is translation invariant. 


3.18 L?(0,1), 0 < p < 1, is complete. 
The proof of this is left as an exercise. 
3.19 L?(0,1) isa TVS. 


The continuity of addition is a direct consequence of the translation in- 
variance of d. If f, > fanda, > a,a, nF, d(a,f,,af) = (a,f, — af)), 
< (anf, — &,f))p + (anf — Of) = lel’, — fp + le, — aL), 
< CUS, —f)), + la, — a?(f)),, where C is a constant independent of n. 
Hence a, f,, ~ af. Thus L (0,1) is a Frechet space. 

3.20 If G is a nonempty open convex subset of L?(0,1),then 
G = L?(0,1). 

To see this, first suppose f € L?(0,1) and ((f)), =r < R. As a function 
of t, {5|f(x)|? dx is continuous, assumes the value 0 at ¢ = 0, and assumes 
the value r at ¢=1. Let 0 <¢ <1 such that [o|f(x)|?dx =r/2. Define 
g,h: (0,1) > F by g(x) = f(x) for x < t and 0 otherwise; h(x) = f(x) for 
x > t and 0 otherwise. Now f= g + h = 3(2g + 2h) and ((2g)), = ((2h)), 
= 2?(r/2) = r/2' ”. Hence f € coB(0O; R/2!~”). This implies that 
B(O; R) € co B(0; R/2!~”), or, equivalently, B(0;2'~’R) € coB(0, R). 
Hence B(0; 4'~?R) € co B(0;2'~?R) € co B(0; R). Continuing we see that 
for all n, B(O;2""°-”R) C coB(O; R). 

So if G is a nonempty open convex subset of L’(0, 1), then by translation 
it may be assumed that 0 € G. Thus there is an R > 0 with B(O; R)C G. 
By the preceding paragraph, B(0;2”"~”R) C coB(0O; R) C G forall n > 1. 
Therefore L?(0,1) C G. 

Also note that this says that the only continuous linear functional on 
L?(0,1), 0 < p < 1, is the identically zero functional. 


EXERCISES 
1. Prove Theorem 3.1. 


2. Let p be a sublinear functional, G = {x: p(x) < 1}, and define the sublinear 
functional g for the set G as in Proposition 3.2. Show that g(x) = max( p(x), 0) 
for all x in 2. 


3. Let 4 C #,a TVS, and show that the following statements are equivalent: (a) 
M is an affine hyperplane; (b) there exists an x, in @ such that W— x, isa 
hyperplane; (c) there is a linear function f: 2— F and an a in F such that 
M={x Ed: f(x) = a}. 


4. Let & be a real TVS. Show: (a) if G is an open connected subset of %, then G 
is arcwise connected; (b) if f: 2— R is a continuous linear functional, then 
#\ kerf has two components, { x: f(x) > 0} and {x: f(x) < 0}. 


5. If & is a complex TVS and f: 2-— C is a nonzero continuous linear function, 
show that 2 \ kerf is connected. 
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6. Prove Proposition 3.6. 


7. If f: #— R is acontinuous R-linear functional and A is an open convex subset 
of #, then f(A) is an open interval. 


8. Prove Corollary 3.12. 

9. Prove Theorem 3.13. 
10. State and prove a version of Theorem 3.7 for a complex TVS. 
11. State and prove a version of Corollary 3.11 for a complex LCS. 
12. State and prove a version of Corollary 3.12 for a complex LCS. 
13. Prove (3.18). 


14. Give an example of a TVS & that is not locally convex and a subspace Y of # 
such that there is a continuous linear functional f on Y with no continuous 
extension to 2. 


§4*. Some Examples of the Dual Space of a Locally 
Convex Space 


As with a normed space, if @ is a LCS, #* denotes the space of all 
continuous linear functionals f: #— F. %* is called the dual space of &. 


4.1. Proposition. Let X be completely regular and let C( X ) be topologized as 
in Example 1.5. If L: C(X) — F is a continuous linear functional, then there 
is a compact set K and a regular Borel measure 1. on K such that L¢ f) = fefdp 
for every f in C(X). Conversely, each such measure defines an element of 
C(X)*. 

PROOF. It is easy to see that each measure pp supported on a compact set K 
defines an element of C(X)*. In fact, if p,(f) = sup{|f(x)|: x © K } and 
L(f) = fxfdp, then |L(f)| < |lullpx(f), and so L is continuous. 

Now assume L © C(X)*. There are compact sets K,,..., K, and posi- 
tive numbers a,,...,a, such that |L(f)| < imi, PKS) (3.1f). Let K = 
U"_,K, and a = max{|a,|: 1 <j <n}. Then |L(f)| < ap,(f). Hence if 
f<=C(X) and f\K = 0, then L(f) = 0. 

Define F: C(K ) > F as follows. If g € C(K), let & be any continuous 
extension of g to X and put F(g) = L(g). To check that F is well defined, 
suppose that g, and g, are both extensions of g to X. Then g, — g, = Oon 
K, and hence L(g,) = L(g,). Thus F is well defined. It is left as an exercise 
for the reader to show that F: C(K) - F is linear. If g € C(K) and ¢ is 
an extension in C(X), then |F(g)| = |L(g)| < ap,(£) = allg||, where the 
norm is the norm of C(K ). By (III.5.7) there is a measure p in M(K ) such 
that F(g) = {,gdp. If fe CCX), then g = f\K € C(K) and so L(f)= 
F(g)=fxfdp. 
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Let C,, denote the extended complex plane. Thus C,, = C U {00} with 
the metric it obtains from its identification with the sphere. If y: [0,1] > C 
is a rectifiable curve and f is a continuous function defined on the trace of 
y, y((0,1]), then f,f is the line integral of f over y. That is, [,/f= 

of(y(t)) dy(t). (See Conway [1978].) The next result generalizes to arbi- 
trary regions in the plane, but for simplicity it is stated only for the disk D. 
Recall the definition of H(D) from Example 1.6. 


4.2. Proposition. L © H(D)* if and only if there is an r < 1 and a unique 
function g analytic on C ,, \ B(O; r) with g(co) = 0 such that 


43 L(t) = = ffs 


for every f in H(D), where y(t) = pe",0 <t < 2a, andr<p <1. 
Proor. Let g be given and define L as in (4.3). If K = {z: |z| = p}, then 


L(A = se] [P(e 8(pe")ine" at 


= == Px (f) Px (g)2"0. 


So if ¢ = ppx(g), IL(f)| < px(f), and L © H(D)*. 
Now assume that L € H(D)*. The Hahn—Banach Theorem implies there 


is an F in C(D)* such that F|H(D) = L. By Proposition 4.1 there is a 
compact set K contained in D and a measure uw on K such that L(f) = 
{-fdp for every f in A(D). Define g: C,\K—>C by g(o)=0 and 
g(z) = —fel/(w — z)dp(w) for z in C\ K. By Lemma III.8.2, g 1s 
analytic on C,,\ K. Let p <1 such that K C B(0;p). If y(t) = pe”, 
0 <t < 2a, then Cauchy’s Integral Formula implies 


1 ¢ £2) 
i= rk emer 
for |w| < p; in particular, this is true for w in K. Thus, 


L(f)= ffl) du(w) 
=f fee erat |aucw) 


ee it\ jit 1 
= 37 [ “i (ve"e ii in(w) | 


— W 


- ai | f2)8(2) de 


This completes the proof except for the uniqueness of g (Exercise 3). a 
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EXERCISES 


1. Let {%;: i © I} be a family of LCS’s and give =[1{%,: i © 7} the product 
topology. (See Exercise 1.17.) Show that L © 2* if and only if there is a finite 
subset F contained in J and there are x* in 2* for j in F such that 
L(x) = Lijec px*(x(J/)) for each x in &. 


2. Show that the space s (Exercise 1.13) is linearly homeomorphic to C(N) and 
describe s*. 


3. Show that the function g obtained in Proposition 4.2 is unique. 


4. Show that L © AH(D)* if and only if there are scalars by, b,,... in C such that 
lim sup [,|'/" < Land L(f) = 2% 91 /(n) f Ob,. 


5. If G is an annulus, describe H(G)*. 


6. (Buck [1958]). Let X be locally compact and let B be the strict topology on 
C,(X) defined in Exercise 1.21. (Also see Exercises 2.6 and 2.7.) Prove the 
following statements: (a) If » © M(X) and e, 10, then there are compact sets 
K,, K,,... such that foreach n > 1, K, C int K,,, and |u|(X\ K,,) < «,. (b) If 
pw © M(X), then there is a @ in Co(X) such that $ > 0, |wi(X\ {x: o(x) > 0}) 
= 0,1/¢ € L'(|y]), and {1/¢ djp| < 1. (c) Show that if » € M(X) and L(f) = 
/fdp for f in C,(X), then L € (C,(X), B)*. (d) Conversely, if L € (C,(X), B)*, 
then there is a wp in M(X) such that L(f) = {fdp for f in C,(X). 


7. Let X be completely regular and let 4 be a linear manifold in C( X). Show that 
if for every compact subset K of X, @|K={f|K: f ©} is dense in C(K), 
then -@ is dense in C(X). 


§5*. Inductive Limits and the Space of Distributions 


In this section the most general definition of an inductive limit will not be 
presented. Rather one that removes certain technicalities from the argu- 
ments and yet covers the most important examples will be given. For the 
more general definition see Kdéthe [1969], Robertson and Robertson [1966], 
or Schaefer [1971]. 


5.1. Definition. An inductive system is a pair (2%, { %,;: i © I}), where % is 
a vector space, %, is a linear manifold in 2 that has a topology 7, such 
that (%,, 7,) is a LCS, and, moreover: 


(a) J is a directed set and 2, Cc x if i<j; 
(b) ifi<jandU € 7, thnU 2% € 7; 
(c) ZH=UlZ 7671}: 


Note that condition (b) is equivalent to the condition that the inclusion 
map %, > %, is continuous. 


t 
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5.2. Example. Let d > 1 and let 2 be an open subset of R%. Denote by 
C‘*)(2) all the functions @: 2 — F such that ¢ is infinitely differentiable 
and has compact support in (2. (The support of ¢ is defined by spt ¢ = cl{ x: 
(x) # 0}.) If K is a compact subset of 02, define D(K) = {6 © C%(&2): 
spto@ C K }. Let D(K) have the topology defined by the seminorms 


Px,m( >) = sup{|o(x)|: |k| < m, x © K }, 
where k = (k,,...,kz), k; © N U {O}, |k| =k, +--+ +k, and 


Then (C°(Q), {G(K): K is compact in 2}) is an inductive system. The 
space C°°({2) is often denoted in the literature by Z(S2), as it will be in this 
book. 

This example of an inductive system is the most important one as it is 
connected with the theory of distributions (below). In fact, this example was 
the inspiration for the definition of an inductive limit given now. 


5.3. Proposition. /f (%,{%,,.7,}) is an inductive system, let B = all con- 
vex balanced sets V such thatV\ %,€ 7, for alli. Let 7 = the collection of 
all subsets U of & such that for every x, in U there is a V in @ with 
Xp + VC U. Then (2,7) is a (not necessarily Hausdorff) LCS. 


Before proving this proposition, it seems appropriate to make the follow- 
ing definition. 


5.4. Definition. If (2%, { %;}) is an inductive system and .7 is the topology 
defined in (5.3), 7 is called the inductive limit topology and (2%, 7 ) is said 
to be the inductive limit of { %; }. 


5.5. Lemma. With the notation as in (5.3), BCT. 


PrRoor. Fix V in @&. It will be shown that V is absorbing at each of its 
points. Indeed, if xy» © V and x € &, then there is an 2, and an 2, such 


that x, © 4%, and x © %. Since J is directed. there is a k in J with 
k >i, j. Hence x), x € %,. BurVN 2%, © Z,. Thus there is an e > 0 such 
that x, tax ©€VN24,C V for ja| < «. 

Since V is convex, balanced, and absorbing at each of its points, there is a 
seminorm p on # such that VV = {x © #: p(x) < 1} (1.14). So if x9 € V, 
P(X))=%< 1. Lett W={x Ee: p(x) < 30 — 7)}. Then W= 30 - 
ry V andsoWe &. Sincexg +WOV, VET. i 


PROOF OF PROPOSITION 5.3. The proof that 7 is a topology is left as an 
exercise. To see that (%, 7) is a LCS, note that Lemma 5.5 and Theorem 
1.14 imply that 7 is defined by a family of seminorms. a 
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For all we know the inductive limit topology may be trivial. However, the 
fact that this topology has not been shown to be Hausdorff need not 
concern us, since we will concentrate on a particular type of inductive limit 
which will be shown to be Hausdorff. But for the moment we will continue 
at the present level of generality. 


5.6. Proposition. Let (%,{%,}) be an inductive system and let Z be the 
inductive limit topology. Then 


(a) the relative topology on &, induced by J (viz., Z\%,) is smaller than 7,; 

(b) if Y is a locally convex topology on & such that for every i, UX, < TJ, 
then UCT; 

(c) a seminorm p on & is continuous if and only if p\|%, is continuous for 


each i. 


PROOF. Exercise 3. 


5.7. Proposition. Let (2%, 7) be the inductive limit of the spaces {(2,, 7;): 
iG I}. If Y isa LCS and T: > Y is a linear transformation, then T is 
continuous if and only if the restriction of T to each 2, is 7-continuous. 


I 


PROOF. Suppose 7: %-— ¥Y is continuous. By (5.6a), the inclusion map 
(%,,7,) > (#, J) is continuous. Since the restriction of T to %, is the 
composition of the inclusion map %, > % and T, the restriction is continu- 
ous. 

Now assume that each restriction is continuous. If p is a continuous 
seminorm on &Y, then p°7|%, is a 7-continuous seminorm for every i. By 


(5.6c), p°T is continuous on 2. By Exercise 1.23, T is continuous. i 


It may have occurred to the reader that the definition of the inductive 
limit topology depends on the choice of the spaces 2%, in more than the 
obvious way. That is, if 2=U,Y and each ¥Y has a topology that is 
“compatible” with that of the spaces {%,}, perhaps the inductive limit 
topology defined by the spaces {Y,} will differ from that defined by the 
{%,}. This is not the case. 


5.8. Proposition. Let (2, ((%;,.7;)}) and (%,(Y,¥%,)}) be two inductive 
systems and let J and Y be the corresponding inductive limit topologies on &. 
If for every i there is aj such that ¥,;C Y, and U\Z,C 7F,, thenUCT. 


PROOF. Let V be a convex balanced subset of % such that for every /, 
VAY EU. If 2; is given, let j be such that 2;C Y and U|%; C 7, 


Hence VN 2,=(VO¥)N4,e7,. Thus V € @ [as defined in (5.3)}. It 
now follows that YC 7. a 


5.9. Example. Let % be any vector space and let {%;,: i © J} be all of the 
finite-dimensional subspaces of 4. Give each 2%, the unique topology from 


I 
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its identification with a Euclidean space. Then (%, { %;}) is an inductive 
system. Let 7 be the inductive limit topology. If ¥ isa LCS and 7: > Y 
is a linear transformation, then T is #continuous. 


5.10. Example. Let X be a locally compact space and let {K;: ie I} be 
the collection of all compact subsets of X. Let 2; = all f in C(X) such that 
spt f C K,. Then U,%, = C,(X), the continuous functions on X with com- 
pact support. Topologize each 2, by giving it the supremum norm. Then 
(C.(X), {%,}) is an inductive system. 


Let U; be the open subsets of X such that clU;, is compact. Let C(U;) be 
the continuous functions on U; vanishing at oo with the supremum norm. If 
feC(U,) and f is defined on X by letting it be identically 0 on X\ U, 
then f € C,(X). Thus (C,(X), (Co(U,)}) is an inductive system. Proposition 
5.8 implies that these two inductive systems define the same inductive limit 
topology on C.( X). 


5.11. Example. Let d>1 and put K,={x ER“: ||x|| <n}. Then 
(D(R“), {D(K,,)}%,) is an inductive system. By (5.9), the inductive limit 
topology defined on QZ(IR“) by this system equals the inductive limit 
topology defined by the system given in Example 5.2. 


If Q is any open subset of R%, then 2 can be written as the union of a 
sequence of compact subsets {K,,} such that K, ¢ int K,,,. It follows by 
(5.9) that {D(K,,)} defines the same topology on Y({2) as was defined in 
Example 5.2. 

The preceding example inspires the following definition. 


5.12. Definition. A_ strict inductive system is an inductive system 
(%,{2%,, F,}°%_,) such that for everyn > 1, %, C441, F411%, =F, and 
& is closed in 2%, ,,. The inductive limit topology defined on % by such a 
system is called a strict inductive limit topology and & is said to be the strict 


inductive limit of { %,,}. 


Example 5.11 shows that DR“), indeed D(), is a strict inductive limit. 
The following lemma is useful in the study of strict inductive limits as 
well as in other situations. 


§.13. Proposition. If 2 is a LCS, ¥< %, and p is a continuous seminorm 
on Y, then there is a continuous seminorm p on & such that p\Y = p. 


ProoFr. Let U = {y © &: p(y) < 1}. So U 1s open in Y; hence there is an 
open subset V, of & such that V, 1 Y= U. Since 0 € V, and & is a LCS, 
there is an open convex balanced set V in 2 such that V C V,. Let g = the 
gauge of V. Soif y € Y and q(y) < 1, then p(y) < 1. By Lemma III.1.4, 
psy. 
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Let W=co(U U V); it is easy to see that W is convex and balanced 
since both U and V are. It will be shown that W is open. First observe that 
W={tuut+(1-t)u:i0<t<1l,ueU,vu € V} (verify). Hence W = U{tU 
+1 -t)V: O<t<1}. Put W=tU+(1—- tv. So W= V, which is 
open. If0 <4 <1, W,=U{tu+ (1 — t)V: u © V}, and hence is open. But 
W, = U, which is not open. However, if u © U, then there is an e > 0 such 
that eu © V. For 0 <t<1, let y=¢‘'[l-—e+telu (© Y). Ast], 
y, 2 u. Since U is open in Y, there is a t, 0 <¢ <1, with y, in U. Thus 
u = ty,+ (1 — t)(eu) € W,. Therefore W=U{W,: 0<1<1} and W is 
open. 


5.14. Clam. WO Y= U. 


Infact, UC W,soU CWO @WIf wE WO Y, then w = tu+ (1 — t)v, 
uinU,vinV,0 < t < 1; it may be assumed that 0 < ¢t < 1. (Why?) Hence, 
v=(1-t) (w-weEeY.SovE VN YE U; hence we U. 

Let p = the gauge of W. By Clam5.14,{yeE®%: piy)< 1s ={yeE@: 
p(y) < 1}. By the uniqueness of the gauge, p|\Y= p. a 


5.15. Corollary. If 2 is the strict inductive limit of { %,\, k is fixed, and p, 
is a continuous seminorm on 2, then there is a continuous seminorm p on & 
such that p|2, = p,. In particular, the inductive limit topology is Hausdorff. 


PRooF. By (5.13) and induction, for every integer n > k, there is a continu- 
ous seminorm p, such that p,|%_, = p,_;. If x © %, define p(x) = p,(x) 
when x € &,. Since 2, € %,,, forall n, the properties of { p,} insure that 
p is well defined. Clearly p is a seminorm and by (5.6c) p is continuous. 

If x © & and x # 0, there is a k > 1 such that x € #,. Thus there is a 
continuous seminorm p, on %, such that p,(x) # 0. Using the first part 
of the corollary, we get a continuous seminorm p on & such that p(x) # 0. 


Thus (2, 7 ) is Hausdorff. a 


5.16. Proposition. Let 2 be the strict inductive limit of { %,,\. A subset B of 
X is bounded if and only if there is ann > 1 such that BC &, and B is 
bounded in 2. 


The proof will be accomplished only after a few preliminaries are settled. 
Before doing this, here are a few consequences of (5.16). 


5.17. Corollary. If % is the strict inductive limit of {2,,\, then a subset K of 
X is compact if and only if there is ann >1 such that K C2, and K is 
compact in 2). 


5.18. Corollary. [f & is the strict inductive limit of Frechet spaces {%,,\, Y 
is a LCS, and T: &- ¥Y is a linear transformation, then T is continuous if 
and only if T is sequentially continuous. - 
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PRooF. By Proposition 5.7, T is continuous if and only if 7|%,, is continu- 
ous for every n. Since each %, is metrizable, the result follows. a 


Note that using Example 5.11 it follows that for an open subset 2 of R%, 
GQ (82) is the strict inductive limit of Frechet spaces [each Y(K,,) is a Frechet 
space by Proposition 2.1]. So (5.18) applies. 


5.19. Definition. If 2 is an open subset of R%, a distribution on Q is a 
continuous linear functional on Z({2). 


Distributions are, in a certain sense, generalizations of the concept of 
function as the following example illustrates. 


5.20. Example. Let f be a Lebesgue measurable function on {2 that is 
locally integrable (that is, /{,|f|dA < oo for every compact subset K of 
(2—here X is d-dimensional Lebesgue measure). If L;: D(§2) > F is defined 
by L;(¢) = ffod, L, is a distribution. 


From Corollary 5.18 we arrive at the following. 


5.21. Proposition. A linear functional L: 9(Q) —> F is a distribution if and 
only if for every sequence {,} in B(Q) such that cl[U%_,spt¢,] = K is 
compact in Q and (x) — 0 uniformly on K as n> © for every k = 
(k,,...,k,), it follows that L(¢,) > 0. 


Proposition 5.21 is usually taken as the definition of a distribution in 
books on differential equations. There is the advantage that (5.21) can be 
understood with no knowledge of locally convex spaces and inductive limits. 
Moreover, most theorems on distributions can be proved by using (5.21). 
However, the realization that a distribution is precisely a continuous linear 
functional on a LCS contributes more than cultural edification. This knowl- 
edge brings power as it enables you to apply the theory of LCS’s (including 
the Hahn-Banach Theorem). 

The exercises contain more results on distributions, but now we must 
return to the proof of Proposition 5.16. To do this the idea of a topological 
complement is needed. We have seen this idea in Section IJI.13. 


5.22. Proposition. [f 2 isa TVS and Y< 2, the following statements are 
equivalent. 


(a) There is a closed linear subspace & of 2 such that YQ &= (QO), 
Y+ F=X, and the map of ¥X Z-¢E given by (y,z)?~yt+2Zzisa 
homeomorphism. 

(b) There is a continuous linear map P: #—> 2% such that PE = Y and 
pe =P, 
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PROOF. (a) => (b): Define P: 2—> 2 by P(y + z)= y, for y in Y and z in 
Z. It is easy to verify that P is linear and P= Y. Also, P*(y + z) = 
PP(y+z)= Py=y=P(y +z); so P?=P. If {y,+z,} is a net in & 
such that y, + z; > y + z, then (a) implies that y, > y (and z, — z). Hence 
P(y, + z;) > P(y + z) and P is continuous. 

(b) = (a): If P is given, let 2= ker P. So ®%< &. Also, x = Px + (x - 
Px) and y = Px € Y, and z = x — Px has Pz = Px — P’x = Px — Px = 
0,soze&. Thus, ¥+ £=2. lf x Ee YNZ, then Px = 0 since x € &; 
but also x = Pw for some w in Z since x € Y= P&. Therefore 0 = Px = 
P*w = Pw = x; that is, YM X= (0). Now suppose that { y,} and {z,} are 
nets in Y and 2. If y,— y and z,—>z, then y,+ z,> y+ z because 
addition is continuous. If, on the other hand, it is assumed that y, + z,; > y 
+ z, then y = P(y + z)= lim P(y, + z;) = limy, and z, = (y, + z,) —), 
— z. This proves (a). a 


5.23. Definition. If 2 is a TVS and Y¥< %, ¥Y is topologically comple- 
mented in & if either (a) or (b) of (5.22) 1s satisfied. 


5.24. Proposition. Jf Z isa LCS and Y< & such that either dimY < © or 
dim 2/Y < oo, then ¥ is topologically complemented in &. 


PROOF. The proof will only be sketched. The reader is asked to supply the 
details (Exercise 9). 

(a) Suppose d = dim¥Y< oo and let y,,..., y, be a basis for Y. By the 
Hahn-Banach Theorem (III.6.6), there are f,,...,f, in %* such that 
fi(y,) = lif i = 7 and 0 otherwise. Define Px = en (Xx)y;. 

(b) Suppose d = dim 2/Y¥< 0, OQ: > 42/Y is the natural map, and 
Z1,---,Zy&& such that Q(z,),...,Q(z,) is a basis for 2/Y. Let #= 
VEZ nson ea) a 


PROOF OF PROPOSITION 5.16. Suppose 2% is the strict inductive limit of 
{(%,,, F,}) and B is a bounded subset of %. It must be shown that there is 
an n such that BC 2%, (the rest of the proof is easy). Suppose this is not 
the case. By replacing { %,,} by a subsequence if necessary, it follows that 
for each n there is an x, in B\ &,. Let p, be a continuous seminorm on 
4, such that p,(x,) = 1. 


5.25. Claim. For every n > 2 there is a continuous seminorm p, on 2%, 
such that p,(x,) =n and p,|%,_, = p,-1- 


The proof of (5.25) is by induction. Suppose p,, is given and let Y= 2, V 
{X,41}- By (5.24), 4, and V{x,,,} are topologically complementary in %. 
Define gq: ¥ > [0, 00) by g(x + ax,,,) = p,(x) + (n + 1)lal, where x © Z, 
and a © F. Then q is a continuous seminorm on (¥Y, 7 ,,|¥). (Verify!) By 
Proposition 5.13 there is a continuous seminorm p,,, on %,,, such that 
Pai ilY = q. Thus p,.,|%, =p, and p,.i(*%,41;) =n + 1. This proves the 
claim. 
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Now define p: 2- [0, 00) by p(x) = p,(x) if x © Z,. By (5.25), p is 
well defined. It is easy to see that p is a continuous seminorm. However, 
sup{ p(x): x © B} = o, so B is not bounded (Exercise 2.4f). ie 


EXERCISES 
1. Verify the statements made in Example 5.2. 
2. Fill in the details of the proof of Proposition 5.3. 
3. Prove Proposition 5.6. 
. Verify the statements made in Example 5.9. 


4 

5. Verify the statements made in Example 5.10. 
6. Verify the statements made in Example 5.11. 
Z 


. With the notation of (5.10), show that if X is o-compact, then the dual of C(X) 
is the space of all extended F-valued measures. 


8. Is the inductive limit topology on C(X) (5.10) different from the topology of 
uniform convergence on compact subsets of X (1.5)? 


9. Prove Proposition 5.24. 
10. Verify the statements made in Example 5.20. 
For the remaining exercises, 2 is always an open subset of R“, d > 1. 
11. If » is a measure on 2, d > {¢$ dp is a distribution 2. 


12. Let f: 2 — F be a function with continuous partial derivatives and let L, be 
defined as in (5.20). Show that for every ¢ in 9(92) and1 <j < d, L,;(d¢/0x;) 
= —L,(¢), where g = df/dx,. (Hint: Use integration by parts.) 


13. Exercise 12 motivates the following definition. If L is a distribution on 2, 
define 0L/dx,;: (2) > F by dL/dx,;(>) = —L(d$/dx;) for all @ in D(&2). 
Show that dL/dx, is a distribution. 


14. Using Example 5.20 and Exercise 13, one is justified to talk of the derivative of 
any locally integrable function as a distribution. By Exercise 11 we can differen- 
tiate measures. Let f: R — R be the characteristic function of [0, 00) and show 
that its derivative as a distribution is 6,, the unit point mass at 0. [That is, 69 is 
the measure such that 6)(4) = 1if 0 © A and 6)(A) = 0 1f 0 € A|] 


15. Let f be an absolutely continuous function on R and show that (L,)’ = Ly. 


16. Let f be a left continuous nondecreasing function on R and show that (L,)’ is 
the distribution defined by the measure p such that pla, b) = f(b) — f(a) for 
all a < b. 


17. Let f be a C® function on 22 and let L be a distribution on Y(2). Show that 
M(¢) = L(¢f), ¢ in B(Q), is a distribution. State and prove a product rule for 
finding the derivative of M. 


CHAPTER V 


Weak ‘Topologies 


The principal objects of study in this chapter are the weak topology on a 
Banach space and the weak-star topology on its dual. In order to carry out 
this study efficiently, the first two sections are devoted to the study of the 
weak topology on a locally convex space. 


§1. Duality 


As in §IV.4, for a LCS 2, let 2@* denote the space of continuous linear 
functionals on 2. If x*, y*@2* and aeéF, then (ax* + y*)\(x)= 
ax*(x)+y*(x), x in &, defines an element ax* + y* in 4*. Thus 2* 
has a natural vector-space structure. 

It is convenient and, more importantly, helpful to introduce the notation 


(x, x") 
to stand for x*(x), for x in & and x* in %*. Also, because of a certain 
symmetry, we will use (x*, x) to stand for x*(x). Thus 


x*(x) = (x, x*) = (x*, x). 
We begin by recalling two definitions (IV.1.7 and IV.1.8). 
1.1. Definition. If & is a LCS, the weak topology on 2, denoted by “wk” 


or o(%, %*), is the topology defined by the family of seminorms { p,.: 
x* € &*\, where 


Pyx (x) = |{x,x*)]. 
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The weak-star topology on #*, denoted by “wk*” or o(2%*, &), is the 
topology defined by the seminorms { p,: x € %}, where 


p,(x*) = |¢x,x*)I- 
So a subset U of % is weakly open if and only if for every x, in U there is 
an e > Q and there are x*,..., x* in 2* such that 


n 
n 


() ie Sl e= eee = e} Cc U. 

k=1 
A net {x,} in % converges weakly to x, if and only if (x;,x*) > (xo, x*) 
for every x* in 4 *. (What are the analogous statements for the weak-star 
topology?) 

Note that both (%,wk) and (%*,wk*) are LCS’s. Also, 2 already 
possesses a topology so that wk is a second topology on 2. However, 2 * 
has no topology to begin with so that wk* is the only topology on 2*. Of 
course if 2 is a normed space, this last statement is not correct since %* is 
a Banach space (III.5.4). The reader should also be cautioned that some 
authors make no distinction between the weak and weak-star topologies. 
Finally, pay attention to the positions of 2 and %* in the notation 
o( 2, &*) = wk and o( &*, &) = wk*. 

If {x,} is a net in 2 and x,;—>0 in &, then for every x* in 2%, 
(x,,x*) > 0. So if 7 is the topology on 2, wk C 7 (A.2.9) and each x* 
in &* is weakly continuous. The first result gives the converse of this. 


1.2. Theorem. Jf 2 is a LCS, (%,wk)* = 2%. 


PROOF. Let f € (%,wk)*; that is, f is a wk-continuous linear functional on 
&. By (IV.3.1f) there are x*,x*,...,x* in 2* such that |f(x)| < 
ye-il(x, xF)>| for all x in &. This implies that MZ_,kerxf C kerf. By 
(A.1.4), there are scalars a,,...,a, such that f= L7_,a,xf; hence fe %*. 

a 


There is a similar result for wk*; the proof is left for the reader. 
1.3. Theorem. [f 2 is a LCS, (%*, wk*)* = 2. 


So # is the dual of a LCS—-(2*, wk*)—and hence has a weak-star 
topology—o((%, wk*)*, #*). As an exercise in notational juggling, note 
that o(( 2%, wk*)*, &*) = 0( Z, %*). 

All unmodified topological statements about 2% refer to its original 
topology. Soif A C & and we say that it is closed, we mean that A is closed 
in the original topology of %. To say that A is closed in the weak topology 
of & we say that A is weakly closed or wk-closed. Also cl A means the 
closure of A in the original topology while wk — cl A means the closure of 
A in the weak topology. The next result shows that under certain cir- 
cumstances this distinction is unnecessary. 
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1.4. Theorem. Jf 2% is a LCS and A is a convex subset of %, then 
cl A = wk — clA. 


PROOF. If 7 is the original topology of 2, then wk C 7, hence cl A C 
wk — cl A. Conversely, if x € 2\ cl A, then (IV.3.13) implies that there is 
an x* in &*, an a in R, and an e > O such that 


Re(a,x*) <a<ate< Re(x,x*) 


for all a in clA. Hence clACB={yeEZ: Recy,x*) < a}. But B is 
clearly wk-closed since x* is wk-continuous. Thus wk — cl A C B. Since 
x €B,x €wk—clA. a 


1.5. Corollary. A convex subset of & is closed if and only if it is weakly 
closed. 


There is a useful observation that can be made here. Because of (III.6.3) it 
can be shown that if 2 is a complex linear space, then the weak topology on 
& is the same as the weak topology it has if it is considered as a rea] linear 
space (Exercise 4). This will be used in the future. 


1.6. Definition. If 4 C 2%, the polar of A, denoted by A°, is the subset of 
4* defined by 


A° = {x* © #*: |(a,x*)| < 1 forall ain A}. 
If B C 2%, the prepolar of B, denoted by °B, is the subset of % defined by 
°B = {x © 4: |(x, b*)| < 1 for all b* in B}. 
If A C & the bipolar of A is the set °(A°). If there is no confusion, then it 


is also denoted by °A°. 


The prototype for this idea is that if A is the unit ball in a normed space, 
A° is the unit ball in the dual space. 


1.7. Proposition. Jf A C 2%, then 


(a) A° is convex and balanced. 

(b) If A, C A, then A° C AP. 

(c) Ifa €&F and a #0,(aA)° =a 'A°. 

(d) A C PAP, 

(e) A° = (24°)? 

PROOF. The proofs of parts (a) through (d) are left as an exercise. To prove 
(ec) note that A C °A° by (d), so (°A°)° C A® by (b). But A° C °(A°)° 
by an analog of (d) for prepolars. Also, °(A°)° = (°A°)°. Ss 
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There is an analogous result for prepolars. In fact, it is more than analogy 
that is at work here. By Theorem 1.3, (2 *, wk*)* = 2. Thus the result for 
prepolars is a consequence of the preceding proposition. 

If A is a linear manifold in 2 and x* € A®, then ta € A for all t > 0 
and a in A. So 1 = |¢ta, x*)| = t\(a,x*)|. Letting t — oo shows that 
A° = A+, where 


At= {x* in #*: (a,x*) = 0 forall ain A}. 
Similarly, if B is a linear manifold in #*, °B = ~B, where 
+B = {x in &: (x, b*) = 0 for all b* in B}. 
The next result is a slight generalization of Corollary [V.3.12. 


1.8. Theorem. Jf % is a LCS and A C &, then °A° is the closed convex 
balanced hull of A. 


ProorF. Let A, be the intersection of all closed convex balanced subsets of 2 
that contain A. It must be shown that A, =°A°. Since °A° is closed, 
convex, and balanced and A C °A°, it follows that A, C °A°. 

Now assume that x) © #\ A,. A, is a closed convex balanced set so by 
(1V.3.13) there is an x* in #*, an a in R, and an « > OQ such that 


Re(a,,x*) <a<ate< Re(xo, x*) 
for all a, in A,. Since 0 € A,, 0 = (0,x*) <a. By replacing x* with 
a~'x* it follows that there is an e > 0 (not the same as the first €) such that 
Re(a,,x*) <1 <1l+e< Re(xy, x*) 
for all a, in A,. If a, € A, and (a,, x*) = |(a,, x*) Je", then ea, € A, 
and so 
aia = Rete “ax * SL = ReCrp0 x) 


for all a, in A,. Hence x* € Af, and x, € °A®. That is, 2\ A, C Z\ °A®. 
i 


1.9. Corollary. If Z is a LCS and B Cc &*, then (°B)° is the wk* closed 
convex balanced hull of B. 


Using the weak and weak* topologies and the concept of a bounded 
subset of a LCS (IV.2.5), it is possible to rephrase the results associated with 
the Principle of Uniform Boundedness (§III.14). As an example we offer the 
following reformulation of Corollary III.14.5 (which is, in fact, the most 
general form of the result). 


1.10. Theorem. Jf 2 is a Banach space, Y is a normed space, and 
AC Bi X,Y) such that for every x in X, {Ax: A € WL} is weakly bounded 
in Y, then & is norm bounded in B(2,Y). 
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EXERCISES 


1. Show that wk is the smallest topology on 2% such that each x* in 2* is 
continuous. 


2. Show that wk* is the smallest topology on 2* such that for each x in 2%, 
x* > (x, x™*) is continuous. 


3. Prove Theorem 1.3. 


4. Let be a complex LCS and let 2 denote the collection of all continuous 
real linear functionals on 2. Use the elements of 24 to define seminorms on # 
and let o(%,%) be the corresponding topology. Show that o(%,%*) = 
0(%,XR ). 


5. Prove the remainder of Proposition 1.7. 
6. If A C %, show that A is weakly bounded if and only if A° is absorbing in 2%. 


7. Let & be a normed space and let { x,,} be a sequence in & such that x, > x 
weakly. Show that there is a sequence { y,} such that y, © co{x,,Xx>,...,x,} 
and ||y, — x|| — 0. (Hint: use Theorem 1.4.) 


8. If # is a Hilbert space and {h,,} is a sequence in # such that h, — h weakly 
and ||h,|| — ||A||, then ||4, — || > 0. (The same type of result is true for 
L?’-spaces if 1 < p < oo.) 


9. If & is a normed space show that the norm on % is lower semicontinuous for 
the weak topology and the norm on %* is lower semicontinuous for the 
weak-star topology. 


10. Suppose % is an infinite-dimensional normed space. If S = {x © @: ||x]|| = 1}, 
then the weak closure of S is { x: ||x|| < 1}. 


§2. The Dual of a Subspace and a Quotient Space 


In §I1II.4 the quotient of a normed space 2% by a closed subspace .4 was 
defined and in (III.10.2) it was shown that the dual of a quotient space 2/7 
is isometrically isomorphic to ~*~. These results are generalized in this 
section to the setting of a LCS and, moreover, it is shown that when 
(2%/M)* and .@* are identified, the weak-star topology on (%/7Z)* is 
precisely the relative weak-star topology that .@~ receives as a subspace of 
a. 
The first result was presented in abbreviated form as Exercise IV.1.16. 


2.1. Proposition. Let & be a LCS and let F be a family of seminorms 
defining the topology of &. If MH < & and p © F, define p: X/M — (0, oc) 
by 


p(x + @) = inf{ p(x+y): yE}-. 
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Then p is a seminorm on 2/M, and the topology on 2/M defined by 
P= {p: p © FP} is the quotient space topology. 


PROOF. Exercise. 


Thus if 2 isa LCS and 4< &, then 4/7 isa LCS. Let fe (2/7@)*. 
If O: > #/M is the natural map, then f>Q € 2*. Moreover, foQ € 
M~ . Hence f f°Q isa map of (2/7)* ~> M* Cc #*. 


2.2. Theorem. Jf 2 is a LCS, @< 2%, and O: > 2/M is the natural 
map, then f > f °Q defines a linear bijection between (2/M)* and M~. If 
(2/M)* has its weak-star topology o((2/M)*,%/M) and M~ has the 
relative weak-star topology o(&*, 2)|M~ , then this bijection is a homeomor- 
phism. If & is a normed space, then this bijection is an isometry. 


PROOF. Let p: (%/M)* > M* be defined by p(f) = f° Q. It was shown 
prior to the statement of the theorem that p is well defined and maps 
(2/M)* into M°~ . It is easy to see that p is linear and if 0 = p(f) =f °Q, 
then f = 0 since Q is surjective. So p is injective. Now let x* €.@~ and 
define f: 2/4 F by f(x +.Z@) = (x, x*). Because 4 C kerx*, f is 
well defined and linear. Also, Q°-'{x +: |f(x+.@)| <l} ={x Ee: 
|(x, x*)| < 1} and this is open in % since x* is continuous. Thus {x +4: 
f(x + M@)| < 1} is openin 2/4 and so f is continuous. Clearly p( f) = x*, 
SO p 1S a bijection. 

If 2 is a normed space, it was shown in (III.10.2) that p is an isometry. It 
remains to show that p is a weak-star homeomorphism. Let wk* = 
o(%*, #) and let o* = o((2/74)*, F/M). lf { f,} is a net in (2/7)* and 
f, ~ 0(o*), then for each x in 2, (x, p(f,)) = f,(O(x)) > 0. Hence p(/;) 
— 0 (wk*). Conversely, if p(f;) ~ O(wk*), then for each x in 2, f(x + 
M) = (x, p(f;)) > 0; hence f; > 0(0*). a 


Once again let 4< @. If x* © #*, then the restriction of x* to Z, 
x*|.4@, belongs to @*. Also, the Hahn—Banach Theorem implies that the 
map x* > x*|.@ is surjective. If p(x*) = x*|.Z@, then p: 2* > .@* is 
clearly linear as well as surjective. It fails, however, to be injective. How 
does it fail? It’s easy to see that kerp =.@*. Thus p induces a linear 
bijection p: &*/4* > M*. 


2.3. Theorem. If % isa LCS, 4 < %, and p: 2* — M® is the restriction 
map, then p induces a linear bijection p: &*/M* > M*. If &*/M~ has 
the quotient topology induced by o(2*,2) and M* has its weak-star 
topology 0(. 4*, M), then p is a homeomorphism. If & is a normed space, 
then p is an isometry. 


PrRooF. The fact that p is an isometry when @% is a normed space was 
shown in (III.10.1). Let wk* = o(.4@*,.@) and let 7* be the quotient 
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topology on &*/M4* defined by o(2*, #). Let 0: ¥* > X*/M be the 
natural map. Therefore the diagram 


a ° M* 


ON A 3 
K* /M* 


commutes. If y © 4, then the commutativity of the diagram implies that 
QBN yt ©M*: |(y, y*)| <1}) = QT x* +H: [Cy x*)| < 1) 
SA kPS A psx) 1h, 


which is weak-star open in #*. Hence p: (%*/@* ,n*) > (.4*, wk*) is 
continuous. 

How is the topology on #*/4@~ defined? If x € %, p,(x*) = |(x, x*)| 
is a typical seminorm on % *. By Proposition 2.1, the topology on 2*/Z4~* 
is defined by the seminorms { p,: x © 2}, where 


D,(x* + M) = inf{|(x,x* + z*)|: z* EZ}. 
2.4. Claim. If x €.4Z, then p, = 0. 


In fact, let P= {ax: ae}. If x E24, then LAW = (0). Since 
dim 2 < oo, @ 1s topologically complemented in 2+-.4. Let x* € &* 
and define f: 2+ M4 - F by f(ax + y) = (y, x*) for y in 4 and a inF. 
Because -@ is topologically complemented in 2+ 4, if a,x + y, > 0, then 
y,; 2 0. Hence f(a;x + y,) = (y,,x*) 2 0. Thus f is continuous. By the 
Hahn-—Banach Theorem, there is an x* in 2* that extends f. Note that 
x* ~x* © MW. Thus p.(x* + 4°) = p.(x¥ + 4°) < p,(x*) = 
|x, xf)>| = 0. This proves (2.4). 

Now suppose that {x* +.@~} isa net in 2*/@* such that p(x* + 
M*) = x*|\M — 0 (wk*) in @*. If x €% and x €.4, then Claim (2.4) 
implies that p,(x* +.@~*)=0. If x © 4, then p,(x* +. M*) < |<x, x*)| 
— 0. Thus x* +.4~* > 0(7*) and 6 is a weak-star homeomorphism. a" 


EXERCISES 


1. In relation to Claim 2.4, show that if ?< 2%, dim2&< o, and “4 < &, then 
&+ M is closed. 


2. Show that if “4 < % and @ is topologically complemented in %, then 4~ is 
topologically complemented in 2%* and that its complement is weak-star and 
linearly homeomorphic to #* 74> . 
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§3. Alaoglu’s Theorem 


If Z is any normed space, let’s agree to denote by ball % the closed unit 
ball in &. So ball Z= {x © @&: |x|] < 1}. 


3.1. Alaoglu’s Theorem. Jf Z is a normed space, then ball 2 * is weak-star 
compact. 


PRooF. For each x in ball Z, let D. = {a © F: Ja] < 1} and put D= 
Il{ D,: x © ball2}. By Tychonoffs Theorem, D is compact. Define 
7. ball ¥* — D by 


T(x*)(x) = (x, x*). 


That is, 7T(x*) is the element of the product space D whose x coordinate 1s 
(x, x*). It will be shown that 7 is a homeomorphism from (ball 2 *, wk*) 
onto r(ball % *) with the relative topology from D, and that r(ball 2 *) is 
closed in D. Thus it will follow that 7r(ball %*), and hence ball 2 *, is 
compact. 

To see that 7 is injective, suppose that r(x*) = 7(x*). Then for each x 
in ball 2, (x, x*) = (x, x}). It follows by definition that x* = xf. 

Now let {x*} be a net in ball 2* such that x* — x*. Then for each x in 
ball 2, r(x*)(x) = (x, x*) > (x, x*) = 1(x*)(x). That is, each coordi- 
nate of {7r(x*)} converges to t(x*). Hence 1(x}*) > t(x*) and 7 is 
continuous. 

Let x* be a net in ball 2*, let f © D, and suppose 7(x7*) > f in D. So 
f(x) = lim(x, x*) exists for every x in ball 2. If x © 2, let a > 0 such 
that |lax|| <1. Then define f(x) =a 'f(ax). If also B > 0 such that 
Bx|| <1, then a 'f(ax) = a 'lim(ax, x*) = B-'lim( Bx, x*) = 
B-+f( Bx). So f(x) is well defined. It is left as an exercise for the reader to 
show that f: 2-— F is a linear functional. Also, if ||x|| < 1, f(x) © D, so 
if(x)| < 1. Thus f= x* € ball @* and r(x*)=/f. Thus r(ball2%*) is 
closed in D. This implies that 7(ball 2*) is compact and, hence, 7 is a 
homeomorphism (A.2.8). = 


EXERCISES 
1. Show that the functional f occurring in the proof of Alaoglu’s Theorem is linear. 


2. Let & be a LCS and let V be an open neighborhood of 0. Show that V° is 
weak-star compact in 2%. 


3. If & is a Banach space, show that there is a compact space X such that % 1s 
isometrically isomorphic to a closed subspace of C(X). 
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§4. Reflexivity Revisited 


In §IJI.11 a Banach space 2 was defined to be reflexive if the natural 
embedding of % into its double dual, 2 **, is surjective. Recall that if 
x © #, then the image of x in 2**, X, is defined by (using our recent 
notation) 


(x#, 2) = (x, 2%) 
for all x* in 2*. Also recall that the map x > X is an isometry. 

To begin, note that 2**, being the dual space of 2 *, has its weak-star 
topology o( % **, *). Also note that if 2 is considered as a subspace of 
4** then the topology o(%**, 2*) when relativized to % is o( %, #*), 
the weak topology on 2. This will be important later when it is combined 


with Alaoglu’s Theorem applied to 2** in the discussion of reflexivity. But 
now the next result must occupy us. 


4.1. Proposition. Jf Z is a normed space, then ball Z is o( 2 **, & *) dense 
in ball 2 **., 


PrRooF. Let B = the o( 2% **, &*) closure of ball 2 in %**; clearly, BC 


ball @**. If there is an x#* in ball 2**\ B, then the Hahn—Banach 
Theorem implies there is an x* in 2*, an a in R, and an e > 0 such that 


Re(x,x*) <a<ate< Re(x*, x%*) 


for all x in ball 2. (Exactly how does the Hahn—Banach Theorem imply 
this?) Since 0 € ball 2, 0 < a. Dividing by a and replacing x* by a !x*, it 
may be assumed that there is an x* in 2* and an e > 0 such that 


Re(x,x*) <1 < 1+ = Re(x*, xt*) 


for all x in ball &. Since e'’x © ball ¥ whenever x € ball %, this implies 
that |(x,x*)| <1 if ||x|| < 1. Hence x* © ball %*. But then 1+ < 
Recx*, xg*) < |(x*, x¢*)| < ||x§*|| < 1, a contradiction. a 


4.2. Theorem. Jf % is a Banach space, the following statements are equiv- 
alent. 


(a) Z is reflexive. 

(b) &* is reflexive. 

(c) of 2 *, 2%) = 0( 4*, # **),. 
(d) ball & is weakly compact. 


PROOF. (a) = (c): This is clear since 2 = % **. 

(d) = (a): Note that o( %**, %*)|%= o( %, &*). By (d), ballZ% is 
o(2**, #*) closed in ball 2**. But the preceding proposition implies 
ball Z is o(  **, &*) dense in ball 2 **. Hence ball 2 = ball #** and so & 
is reflexive. 
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(c) = (b): By Alaoglu’s Theorem, ball %* is o( 2% *, 2 )-compact. By (c), 
ball &* is o( *,  **) compact. Since it has already been shown that (d) 
implies (a), this implies that 2* 1s reflexive. 

(b) = (a): Now ball2% is norm closed in 2**; hence ball% is 
o( & **, #***) closed in 4** (Corollary 1.5). Since #* = 4*** by (b), 
this says that ball Z is o( % **, *) closed in 2% **. But, according to (4.1), 
ball J is o( **, X*) dense in ball 2 **. Hence ball 2 = ball 2** and 
is reflexive. 

(a) = (d): By Alaoglu’s Theorem, ball 2** is o( 2 **,2%*) compact. 
Since # = & **, this says that ball % is o( 2%, 2 *) compact. i 


4.3. Corollary. If 2 is a reflexive Banach space and <2, then Ml is 
reflexive. 


Proor. Note that ball. 4 =.@0N [ball %], so ball _Z is o( 2%, 4 **) com- 
pact. It remains to show that o(%, *)|.Z = o( 4, 4*). But this follows 
by (2.3). (How?) ie 


Call a sequence {x, } in & a weakly Cauchy sequence if for every x* in 
#*, {(x,, x*)} is a Cauchy sequence in F. 


4.4. Corollary. If % is reflexive, then every weakly Cauchy sequence in & 
converges weakly. That is, & is weakly sequentially complete. 


Proor. Since {{x,,x*)} is a Cauchy sequence in F for each x* in 2%, 
{x,} is weakly bounded. By the PUB there is a constant M such that 
\|x,,|| < M for all n > 1. But {x © %: ||x|| < M} is weakly compact since 2 
is reflexive. Thus there is an x in 2 such that x, —qj> x weakly. But for 
each x* in #*, lim(x,, x*) exists. Hence (x,,x*) > (x, x*), SO x, > xX 
weakly. z 


Not all Banach spaces are weakly sequentially complete. 


4.5. Example. C[0,1] is not weakly sequentially complete. In fact, let 
f(t) = (0 — ntyif0 <¢t < 1/nand f,(t) = Oif1/n <7t< 1.Ifp © MO, 1], 
then {f,du — p({0}) by the Monotone Convergence Theorem. Hence { /, } 
is a weakly Cauchy sequence. However, { f,,} does not converge weakly to 
any continuous function on [0, 1]. 


4.6. Corollary. If % is reflexive, H< %, and x, € &\M, then there is a 
point yo in Ml such that ||x9 — Yol|| = dist(xg, Z). 

PROOF. x > ||x — Xo|| is weakly lower semicontinuous (Exercise 1.9). If 
d= dist(x,, 4), then 4M {x: ||x — Xl] < 2d} is weakly compact and a 
lower semicontinuous function attains its minimum on a compact set. Bi 
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It is not generally true that the distance from a point to a linear subspace 
is attained. If 4 C %, call  proximinal if for every x in % there isa y in 
M such that ||x — y|| = dist(x, #). So if & 1s reflexive, Corollary 4.6 
implies that every closed linear subspace of 2 is proximinal. If 2 is any 
Banach space and -@ is a finite-dimensional subspace, then it is easy to see 
that -@ is proximinal. How about if dim(%/Z) < 00? 


4.7. Lemma. If & is a Banach space and x* © #*, then kerx* is proxim- 
inal if and only if there is an x in &, ||x|| = 1, such that (x,x*) = ||x*|{. 


PROOF. Let 4 = kerx* and suppose that -@ is proximinal. If f: 2/7 > F 
is defined by f(x +.@) = (x,x*), then f is a linear functional and 
AI] = |)x*||. Since dim 2/@ = 1, there is an x in Z such that ||x +.4|| = 1 
and f(x + @) = ||f||. Because 4 is proximinal, there is a y in 4 such 
that 1 = ||x + @|| = ||x + yl|. Thus (x + y,x*) = (x,x*) =f(x + 4) 
= IWfll = lx". 

Now assume that there is an x, in 2 such that ||x || = 1 and (x), x*) = 
|x*|]. If x © & and ||[x +.4|| =a > 0, then |la~'x +.4]| = 1. But also 
Xo + 4@|| = 1. (Why?) Since dim 2/Z = 1, there is a B in F, |B] = 1, such 
that a ‘x + #= B(x, +). Hence a~'x — Bx, €.4, or, equivalently, 
x — aBx, € M4. However, ||x — (x — aBx,)|| = |laBxo|| = a = dist(x, Z). 
So the distance from x to 4 is attained at x — aBx». a 


4.8. Example. If L: C[0,1] — F is defined by 
172 1 
L(f)= [F(x de — fo f(x) dx, 
0 1/2 


then ker Z is not proximinal. 


There is a result in James [1964b] that states that a Banach space is 
reflexive if and only if every closed hyperplane is proximinal. This result is 
very deep. 


EXERCISES 
1. Show that if & is reflexive and 4 < %, then %/@ is reflexive. 


2. If is a Banach space, 4 < %, and both 4 and 2/@ are reflexive, must Z be 
reflexive? 


3. If X is compact, show that C(X) is reflexive if and only if X is finite. 


4. If (X,Q,) is a o-finite measure space, show that L'( X, Q, 1) is reflexive if and 
only if it is finite dimensional. 


5. Give the details of the proofs of the statements made in Example 4.5. 


6. Verify the statement made in Example 4.8. 
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7. If (X, Q, 1) is a o-finite measure space, show that L*() is weakly sequentially 
complete but is reflexive if and only if it is finite dimensional. 


8. Let X be compact and suppose there is a norm on C(_X) that is given by an inner 
product making C(X) into a Hilbert space such that for every x in YX the 
functional f+ f(x) on C(X) is continuous with respect to the Hilbert space 
norm. Show that X is finite. 


§5. Separability and Metrizability 


The weak and weak-star topologies on an infinite-dimensional Banach space 
are never metrizable. It is possible, however, to show that under certain 
conditions these topologies are metrizable when restricted to bounded sets. 
In applications this is often sufficient. 


5.1. Theorem. Jf % is a Banach space, then ball 2* is weak-star metriz- 
able if and only if & is separable. 


Proor. Assume that 2% is separable and let {x,} be a countable dense 
subset of ball #. For each n let D, = {a € F: Ja| < 1}. Put X = [1°_,D,; 
X iS a compact metric space. So if (ball 2 *,wk*) is homeomorphic to a 
subset of X, ball ¥* is weak-star metrizable. 

Define 7+: ball 2* > X by 1(x*) = {(x,,x*)}. If {x*} is a net in 
ball &* and x* — x* (wk*), then for each n > 1, (x,, x7) > (x,,x*); 
hence t(x*) > 7(x*) and 7 is continuous. If 7(x*) = T(y*), (x,,x* - 
y*) =0 for all n. Since {x,,} is dense, x* — y* = 0. Thus 7 is injective. 
Since ball 2* is wk* compact, 7 is a homeomorphism onto its image 
(A.2.8) and ball 2* is wk* metrizable. 

Now assume that (ball 2 *, wk*) is metrizable. Thus there are open sets 
{U,: n= 1} in (ball *, wk*) such that 0 © U, and /%_,U, = (0). By the 
definition of the relative weak-star topology on ball 2 *, for each n there is 
a finite set F, contained in 2 such that {x* © ball 2*: |(x,x*)| <1 for 
all x in F,} Cc U,. Let F = U%_, F;; so F is countable. Also, “(fF ~) is the 
closed linear span of F and this subspace of 2% is separable. But if 
x* © F~, then for each n > 1 and for each x in F,, |(x, x*/||x*||)| = 0 
<1. Hence x*/||x*|| © U, for all n > 1; thus x* = 0. Since F~ = (0), 
*(F +) = 4% and & must be separable. Ie 


Is there a corresponding result for the weak topology? If 2* 1s sep- 
arable, then the weak topology on ball % is metrizable. In fact, this follows 
from Theorem 5.1 if the embedding of @ into 2** is considered. This 
result is not very useful since there are few examples of Banach spaces 2 
such that 2* is separable. Of course if 2 is separable and reflexive, then 
%&* is separable (Exercise 3), but in this case the weak topology on % is the 
same as its weak-star topology when % is identified with 2 **. Thus (5.1) is 
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adequate for a discussion of the weak topology on the unit ball of a 
separable reflexive space. If 2= cy, then 2* = /' and this is separable but 
not reflexive. This is one of the few nonreflexive spaces with a separable 
dual space. 

If 2 is separable, is (ball 2%, wk) metrizable? The answer is no, as the 
following result of Schur demonstrates. 


5.2. Proposition. If a sequence in |' converges weakly, it converges in norm. 


ProorF. Recall that /° = (/')*. Since /' is separable, Theorem 5.1 implies 
that ball/* is wk* metrizable. By Alaoglu’s Theorem, ball/” is wk* 
compact. Hence (ball/*,wk*) is a complete metric space and the Baire 
Category Theorem is applicable. 

Let { f,} be a sequence of elements in /' such that f, > 0 weakly and let 
e > Q. For each positive integer m let 


F,, = { € ball/™: |(f,,6)| < e/3 forn > m}. 


It is easy to see that F,, is wk* closed in ball /© and, because f, — 0 (wk), 
Un" iF, = ball/*. By the theorem of Baire, there is an F, with nonempty 
weak interior. 

An equivalent metric on (ball /”, wk) is given by 


a(o.¥)= 2 2“le(j) - ¥(y)I 
j=l 
(see Exercise 4). Since F,, has a nonempty weak interior, there is a ¢ in F,, 
and a 6 > 0 such that {y © ball/”: d(¢,~) < 6} CF. Let J >1 such 
that 2°/~" < 6. Fix n>m and define y in I” by ¥(j) = ¢(/) for 
1 <j<Jand ¥(j) = sign(f,(j)) for j > J. Thus WA) f,C/) = [CAI for 
j > J. It is easy to see that y € ball/™. Also, d(¢, Y) = +2 716(J) - 
¥(/)| <2-2-% =2°4"-) < 8. Soy &€ F, and hence |(y, f,,)| < ¢/3. Thus 


Leas + LY LG) 


j=] J=HI+1 


E 
5.3 < 3 


for n = m. But there is an m, > m such that for n > m,, D7_,\f,(/)| < €/3. 
(Why?) Combining this with (5.3) gives that 


Ih = DU 
<$4] DL wU+ Los |+] Eos 
eee Du 


whenever n > my. PJ 
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So if (ball /', wk) were metrizable, the preceding proposition would say 
that the weak and norm topologies on /' agree. But this is not the case 
(Exercise 1.10). 

Also, note that the preceding result demonstrates in a dramatic way that 
in discussions concerning the weak topology it is essential to consider nets 
and not just sequences. 

A proof of (5.2) that avoids the Baire Category Theorem can be found in 
Banach [1955], p. 218. 


EXERCISES 


1. Let B = ball M[0,1] and for »,v in M[0,1] define d(p,v) = L%_ 2° "fox" du 
— fix" dv|. Show that d is a metric on M[0,1] that defines the weak-star 
topology on B but not on M(0, 1]. 


2. Let X be a compact space and let Y= {(U,V): U,V are open subsets of X and 
clU C V}. For u = (U,V) in Y, let f,: X — [0,1] be a continuous function such 
that f, = 1 on clU and f,=0 on X\ V. Show: (a) the linear span of { f,: 
u © @\ is dense in C(X); (b) if X is a metric space, then C(X) is separable. 


3. If & is a Banach space and %* is separable, show that (a) % is separable; (b) if 
K is a weakly compact subset of 2, then K with the relative weak topology is 
metrizable. 


4. If B = ball/™, show that d(¢,) = L%_,2 /|¢(/) — ¥(J)| defines a metric on 
B and that this metric defines the weak-star topology on B. 


§6*. An Application: The Stone—Cech 
Compactification 


Let X be any topological space and consider the Banach space C,(X). 
Unless some assumption is made regarding X, it may be that C,(X%) is 
“very small.” If, for example, it is assumed that X is completely regular, 
then C,(X) has many elements. The next result says that this assumption is 
also necessary in order for C,(X) to be “large.” But first, here is some 
notation. 

If x © X, let 6,: C,(X) > F be defined by 6,(/) = f(x) for every f in 
C,(X). It is easy to see that 6, © C,(X)* and |/6,|| = 1. Let 4: X > 
C,(X)* be defined by A(x) = 6,. If {x,;} is a net in X and x, > x, then 
f(x;) > f(x) for every f in C,(X). This says that 6, > 6, (wk*) in 
C,(X)*. Hence A: X — (C,(X)*, wk*) is continuous. Is 4 a homeomor- 
phism of X onto A(X)? 


6.1. Proposition. The map A: X — (A(X), wk*) is a homeomorphism if and 
only if X is completely regular. 
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PRooF. Assume X is completely regular. If x, # x5, then there is an f in 
C,(X) such that f(x,) = 1 and f(x,) = 0; thus 6, (f) # 6,,(f). Hence A 
is injective. To show that A: X — (A(X), wk*) is an open map, let U be an 
open subset of X and let x, © U. Since X is completely regular, there is an 
f in C,(X) such that f(x,)= 1 and f=0 on X\U. Let Vi = {pe 
C,(X)*: (f,w) > O}. Then V, is wk* open in C,(X)* and V, 0 A(X) = 
{6,: f(x) > 0}. So if V=V,NACX), V is wk* open in A(X) and 
6. © VC ACU). Since xy was aebitiary, A(U) is open in A(X). Therefore 
A: X — (A(X), wk*) is a homeomorphism. 

Now assume that A is a homeomorphism onto its image. Since 
(ball C,( X)*, wk*) is a compact space, it is completely regular. Since 
A(X) ¢ ball C,( X)*, A(X) is completely regular (Exercise 2). Thus X is 
completely regular. g 


6.2. Stone—Cech Compactification. If X is completely regular, then there is a 
compact space BX such that: 


(a) there is a continuous map A: X — BX with the property that A: X > A(X) 
is a homeomorphism; 

(b) A(X) is dense in BX; 

(c) if f € C,(X), then there is a continuous map f*: BX > F such that 
fPoha 7 


Moreover, if §2 is a compact space having these properties, then Q is 
homeomorphic to BX. 


PrRoor. Let 4: X > C,(X)* be the map defined by A(x) = 6, and let 
BX = the weak-star closure of A(X) in C,(X)*. By Alaoglu’s Theorem and 
the fact that ||6,|| = 1 for all x, BX is compact. By the preceding proposi- 
tion, (a) holds. Part (b) is true by definition. It remains to show (c). 

Fix f in C,(X) and define f?: BX > F by f8(7) = (f,7) for every 7 in 
BX. [Remember that BX C C,(X)*, so this makes sense.] Clearly f* is 
continuous and f%° A(X) = f9(6,) = (f,8,) = f(x). So ff oA = f and (c) 
holds. 

To show that BX is unique, assume that Q is a compact space and 7: 
X — (2 is a continuous map such that: 


(a’) aw: X > m(X) 1s a homeomorphism; 
(b’) z(X) is dense in QQ; ; ; 
(c’) if f © C,(X), there is an f in C(Q2) such that fom = f. 


Define g: A(X) > Q by g(A(x)) = 2(x). In other words, g = 7° A™!. 
The idea is to extend g to a homeomorphism of BX onto 22. If 7 € BX, 
then (b) implies that there is a net {x,;} in X such that A(x,) > 7 in BX. 
Now {7(x,)} 1s a net in 2 and since 2 is compact, there is an w, in 2 such 
that 7(x;) -q> Wy. If F © C(Q), let f = Fo; so f © C,(X) (and F = f). 
Also, f(x;) = (f,6,.) > (f,%) = f?(1%). But it is also true that f(x,) = 
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F((x,)) ~j> F(w,). Hence F(w)) = f?(1)) for any F in C(Q). This 
implies that w, is the unique cluster point of {7(x,)}; thus 7(x,;) > w, 
(A.2.7). Let g(1)) = Wy. It must be shown that the definition of g(1,) does 
not depend on the net {x,} in X such that A(x;) > 7). This is left as an 
exercise. To summarize, it has been shown that 


6.3 There is a function g: BX > Q 
such that if f € C,(X), then f? = fog. 


To show that g: BX — Q 1s continuous, let {7,} be a net in BX such that 
7, > 7. If F @ C(Q), let f= Foam; so fe C,(X) and f = F. Also, f*(7,) 
— f8(r). But F(g(7,)) = f8(7,) > f8(7) = F(g(7)). It follows (6.1) that 
g(7,) > g(7) in Q. Thus g is continuous. 

It is left as an exercise for the reader to show that g is injective. Since 
2(BX) D g(ACX)) = 7(X), g(BX) is dense in 2. But g( BX) is compact, 
so g is bijective. By (A.2.8), g is a homeomorphism. a 


The compact set BX obtained in the preceding theorem is called the 
Stone-—Cech compactification of X. By properties (a) and (b), X can be 
considered as a dense subset of BX and the map A can be taken to be the 
inclusion map. With this convention, (c) can be interpreted as saying that 
every bounded continuous function on X has a continuous extension to BX. 

The space BX is usually very much larger than X. In particular, it is 
almost never true that BX is the one-point compactification of X. For 
example, if X = (0,1], then the one-point compactification of X is [0, 1]. 
However, sin(1/x) € C,(X) but it has no continuous extension to [0, 1], so 
BX # [0,1]. 

To obtain an idea of how large BX \ X is, see Exercise 6, which indicates 
how to show that if N has the discrete topology, then BN \N has 2% 
pairwise disjoint open sets. The best source of information on the Stone-Cech 
compactification is the book by Gillman and Jerison [1960], though the 
approach to BX is somewhat different there than here. 


6.4. Corollary. If X is completely regular and p © M(BX), define L,: 
C,(X) > F by 


L,(f) =f fed 


for each f in C,(X). Then the map 4 > L, is an isometric isomorphism of 
M(BX) onto C,(X)*. 


Proor. Define V: C,(X) > C(BX) by Vf = f?. It is easy to see that V is 
linear. Considering X as a subset of BX, the fact that BX = cl X implies 
that V is an isometry. If g € C(BX) and f = g|X, then g = f* = Vf; hence 
V is surjective. 
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If pe M( BX) = C(BX)*, it is easy to check that L,€ C,(X)* and 
IZ, I] = |u| since V is an isometry. Conversely, if L © C,(X)*, then 
LoV~' © C(BX)* and ||LoV~'|| = ||L||. Hence there is a » in M(BX) 
such that {gdu = LeV~'(g) for every g in C(BX). Since V~1g = g|X, it 
follows that L = L,. fe] 


The next result is from topology. It may be known to the reader, but it is 
presented here for the convenience of those to whom it is not. 


6.5. Partition of Unity. Jf X is normal and {U,,...,U,} is an open covering 
of X, then there are continuous functions f,,..., f,, from X into [0,1] such that 


(a) Leif, (x) = 1; 
(b) f,(x) = 0 for xin X\ U, andl <k <n. 


PRooF. First observe that it may be assumed that {U,,...,U,} has no 
proper subcover. The proof now proceeds by induction. 

If n = 1, let f, = 1. Suppose n = 2. Then X\ U, and X\ U, are disjoint 
closed subsets of X. By Urysohn’s Lemma there is a continuous function /;: 
X — [0,1] such that f,(x) = 0 for x in X\ U, and f,(x)=1 for x in 
X\ U,. Let f; = 1 — f, and the proof of this case is complete. 

Now suppose the theorem has been proved for some n>2 and 
{U,,...,U,,,} is an open cover of X that is minimal. Let F = X\ U,,,; 
then F is closed, nonempty, and F C UZ_,U,. Let V be an open subset of 
X such that FC VC clV CUZ_\U,. Since clV is normal and {U, 
clV,...,U, 1 clV} is an open cover of clV, the induction hypothesis 
implies that there are continuous functions g,,...,g, on clV such that 
qt =18% = 1 and for l<k<n, 0<g, <1, and g,(clV\ U,)=0. By 
Tietze’s Extension Theorem there are continuous functions §,,..., £, on X 
such that g, = g, onclV andO < &, <1lforl<k<n. 

Also, there is a continuous function h: X — [0,1] such that h = 0 on 
X\ V and h=1 on F. Put f, = g,h for 1 <k <n and let f,,, =1- 
rinis,- Clearly O<f, <1lifl<k<n. If x €clV, then f,, (x)=1- 
(oi -18,(x)) h(x) = 1 — A(x); so 0 <f,44(4) <1 on dV. If x EX\V, 
then f,,,(x) = 1 since h(x) = 0. Hence0 <f,,, < 1. 

Clearly (a) holds. Let 1 < k <n; if x © X\ U,, then either x € (cIV)\ 
U,or x © (X\clV)\ U,. If the first alternative is the case, then g,(x) = 0, 
so f,(x) = 0. If the second alternative is true, then h(x) =0 so that 
f(x) =0. If x © X\U,,, =F, then hA(x)=1 and so f,,,(x)=1- 
Lani 84 (%) = 0. a 


Partitions of unity are a standard way to put together local results to 
obtain global results. If { f,} is related to {U,} as in the statement of (6.5), 
then { f,} is said to be a partition of unity subordinate to the cover {U,}. 
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6.6. Theorem. Jf X is completely regular, then C,(X) is separable if and 
only if X is a compact metric space. 


PROOF. Suppose X is a compact metric space with metric d. For each a, let 
{U<™: 1 < k < N,} be an open cover of X by balls of radius 1/n. Let 
{ f£?: 1 < k < N,} bea partition of unity subordinate to {U{?: 1 < k < 
N,,}. Let Y be the rational (or complex-rational) linear span of { f{””: 
n>1,1<k<N,}; thus ¥ is countable. It will be shown that Y is dense 
in C(X). 

Fix f in C(X) and «> 0. Since f is uniformly continuous there is a 
5 >0 such that |f(x,) —f(x,)| < ¢€/2 whenever d(x,, x,) < 6. Choose 
n> 2/85 and consider the cover {Ui\”: 1< k <N,}. If x1, x, € US”, 
d(x,,X,) < 2/n < 8; hence [f(x,) — f(x,)| < €/2. Pick x, in U{”) and let 
a, € @ + iQ such that |a, — f(x,)| < ¢/2. Let g=L,a,f/", so g € Y. 
Therefore for every x in X, 


f(x) — g(x) = LS (x)fr(*) nS) 
< Lis(s) ~ Ol fe" (%). 


Examine each of these summands. If x € US”, then |f(x) — a,| < |f(x) - 
f(x,)| + If(x,) - a,| < e If x € UL, then f£(x) = 0. Hence |f(x) — 
2(x)| < LD eff?(x) = «. Thus ||f — g|| < « and Y is dense in C(X). This 
shows that C(X) is separable. 

Now assume that C,(X) is separable. Thus (ball C,( X)*, wk*) is metriz- 
able (5.1). Since X is homeomorphic to a subset of ball C,(X)* (6.1), X is 
metrizable. It also follows that BX is metrizable. It must be shown that 
X = BX. 

Suppose there is a tr in BX\ X. Let {x,,} be a sequence in X such that 
x, 2 7. It can be assumed that x, # x,, for n#m. Let A = {x,: n is 
even} and B = {x,: n is odd}. Then A and B are disjoint closed subsets of 
X (not closed in BX, but in X) since A and B contain all of their limit 
points in X. Since X is normal, there is a continuous function f: X — [0, 1] 
such that f= 0 on A and f=1 on B. But then f*(7) = lim f(x,,) = 0 
and f(r) = lim f(x,,,,) = 1, a contradiction. Thus BX¥\ X¥=0. & 


EXERCISES 
1. If x © X and 8,(f) = f(x) for all f in C,(X), show that |/6,|| = 1. 


NO 


. Prove that a subset of a completely regular space is completely regular. 


3. Fill in the details of the proof of Theorem 6.2. 


> 


. If X is completely regular, (2 is compact, and f: X — {2 is continuous, show that 
there is a continuous map f*: BX > Q such that f*|X = f. 


5. If X is completely regular, show that X is open in BX if and only if X 1s locally 
compact. 
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6. Let N have the discrete topology. Let {7,: n © N} be an enumeration of the 
rational numbers in [0,1]. Let S = the irrational numbers in [0,1] and for each s 
in S let {7,: n © N,} be a subsequence of {7,} such that s = lim{r,: n © N.}. 
Show: (a) if s, t€ S and s#t, NON, is finite; (b) if for each s in S, 
cl N, = the closure of N, in BN and A, = (clN,)\N, then {A,: s © S} are 
pairwise disjoint subsets of BN \N that are both open and closed. 


7. Show that if X is totally disconnected, then so is BX. 


8. Show that if r © BX and there is a sequence { x,,} in X such that x, > 7 in BX, 
then tr € X. 


9. Let X be the space of all ordinals less than the first uncountable ordinal and give 
X the order topology. Show that BX = the one point compactification of YX. 
(You can find the pertinent definitions in Kelley [1955].) 


§7. The Krein—Milman Theorem 


7.1. Definition. If K is a convex subset of a vector space 2, then a point a 
in K is an extreme point of K if there is no proper open line segment that 
contains a and lies entirely in K. Let ext K be the set of extreme points 
of K. 


Recall that an open line segment is a set of the form (x,, x.) = {tx, + 
(1 — t)x,: 0 <t <1}, and to say that this line segment is proper is to say 
that x, # x5. 


7.2. Examples. 


(a) If = R? and K = {(x, y) € R?: x? + y” < 1}, then ext K = {(x, y): 
x*+y?=1}. 

(b) If = R? and K = {(x, y) © R*: x < 0}, then ext K =D. 

(c) If =R* and K = {(x,y)eR*: x <0} U {(0,0)}, then ext K = 
{(0, 0)}. 

(d) If K = the closed region in R* bordered by a regular polygon, then 
ext K = the vertices of the polygon. 

(e) If 2 is any normed space and K = {x € @: ||x|| < 1}, thenext K C {x: 
||x|| = 1}, though for all we know it may be that ext K = 0. 

(f) If Z= L‘[0,1] and K = {f © L'[0,1]: |Ifl], < 1}, then ext K =O. This 
last statement requires a bit of proof. Let f © L*[0, 1] such that ||/||, = 1. 
Choose x in [0,1] such that /5|f(t)| dt = 4. Let h(t) = 2f(t) if t< x 
and Q otherwise; let g(t)=2/f(t) if t > x and O otherwise. Then 
All, = llgll, = 1 and f= 4(h +g). So ball L'[0,1] has no extreme 
points. 


The next proposition is left as an exercise. 
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7.3. Proposition. If K is a convex subset of a vector space & andaeé K, 
then the following statements are equivalent. 


(a) ac ext K. 

(b) If x,,x, €& and a= 5(x,+ x), then either x, € K or x, €K or 
xX, =X, = 4. 

(c) If x,,x, &€ %,0<t< 1, anda= tx, + (1 — ¢)x,, then either x, € K, 
x,€K, or x, =x, = 4. 

(d) If x,,...,x, © K anda © co{x,,...,x,}, then a = x, for some k. 

(e) K\ {a} is a convex set. 


7.4. The Krein—Milman Theorem. Jf K is a nonempty compact convex subset 
of a LCS &, then ext K #0 and K = co(ext K ). 


PRooF. (Léger [1968].) Note that (7.3c) says that a point a is an extreme 
point if and only if K \ {a} is a relatively open convex subset. We thus look 
for a maximal proper relatively open convex subset of K. Let Y be all the 
proper relatively open convex subsets of K. Since 2 is a LCS and K #0 
(and let’s assume that K is not a singleton), Y# 0. Let WY bea chain in Y 
and put U, = U{U: U © &%}. Clearly U is open, and since Y is a chain, 
U, is convex. If U) = K, then the compactness of K implies that there is a 
U in %, with U = K, a contradiction to the propriety of U. Thus Uy) © &. 
By Zorn Lemma, Y has a maximal element U. 

Ifx eK and0 <A <1, let T,,: K > K be defined by T, \(y) =Ay + 
(1 — A)x. Note that T, , is continuous and T, ,(27_14;);) = rn 10,T, .(¥,) 
whenever y,,..., y, © K, a,,...,a, = 0, and ay = L- (This means that 
T,, is an affine map of K into K.) If x @U and 0<A <1, then 
T, (U) C U. Thus UC JexU) and T, \(U) iS an open convex subset of 
K. If yeE(clU)\U, T, (vy) © [x, y) G U_ by Proposition IV.1.11. So 
clU ¢ T-\(U) and Renee the maximality of U implies T7,\(U) = K. That 
is, 


75 T, ,(K) CU ifx © Uand0 <A <1. 


Claim. If V is any open convex subset of K, then either VU U=U or 
VUU=K. 


In fact, (7.5) implies that V U U is convex so that the claim follows from 
the maximality of U. 

It now follows from the claim that K\ U is a singleton. In fact, if 
a,b & K\ U and a#b, let V,,V, be disjoint open convex subsets of K 
such that a € V, and b€ V,. By the clam V,U U=K since a € U. But 
b € V, U U, a contradiction. Thus K\ U = {a} and a € ext K by (7.3e). 
Hence ext K #0. 

Note that we have actually proved the following. 
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7.6 If V is an open convex subset of 2 and ext K C V, then K C V. 


In fact, if V is open and convex, VM K € Y and is contained in a 
maximal element U of Y. Since K\ U = {a} for some a in ext K, this is a 
contradiction. Thus (7.6) holds. 

Let E = co(ext K). If x* © %*,a ER, and EC {x ©: Re(x, x*) < 
a} = V, then K C V by (7.6). Thus the Hahn—Banach Theorem (IV.3.13) 
implies E = K. Us 


The Krein—Milman Theorem seems innocent enough, but it has 
widespread application. Two such applications will be seen in Sections 8 
and 10; another will occur later when C*-algebras are studied. Here a small 
application 1s given. 

If & is a Banach space, then ball %* is weak* compact by Alaoglu’s 
Theorem. By the Krein—Milman Theorem, ball #* has many extreme 
points. Keep this in mind. 


7.7. Example. cy is not the dual of a Banach space. That is, cg is not 
isometrically isomorphic to the dual of a Banach space. In light of the 
preceding comments, in order to prove this statement, it suffices to show 
that ballc, has few extreme points. In fact, ballc, has no extreme points. 
Let x € ballcy. It must be that 0 = lim x(n). Let N be such that |x(n)| < 4 
for n > N. Define y,, y, in cy by letting y,(n) = y,(n) = x(n) for n < N, 
and for n > N let y,(n) = x(n) +27" and y,(n) = x(n) — 27". It is easy 
to check that y, and y, € ballco, 3(y, + ¥) = x, and y, # x. 


In light of Example 7.2(f), L'[0,1] is not the dual of a Banach space. 
The next two results are often useful in applying the Krein—Milman 
Theorem. Indeed, the first is often taken as part of that result. 


7.8. Theorem. If 2 is a LCS, K is a compact convex subset of #, and 
FC K such that K = co(F), then ext K C cl F. 


PROOF. Clearly it suffices to assume that F is closed. Suppose that there is 
an extreme point x, of K such that x, ¢ F. Let p be a continuous 
seminorm on # such that FO {x © &: p(x — x.) <1} =O. Let UW = {x 
E #: p(x) < 3}. So(xp + UW) N(F + U) =O; hence x, € cl(F + U). 

Because F is compact, there are y,,..., y, in F such that FC U7_,(y, 
+ U)). Let K, =co(FN(y, + U))). Thus K, Cy, + clU) (Why?), and 
K, © K. Now the fact that K,,..., K,, are compact and convex implies that 
co(K, U--- UK,) = co(K, U --- UK,) (Exercise 8). Therefore 


K = co(F) = co(K, U--- UK,). 


Xg 1S an extreme point of K. Thus, x, = x, © K, for some k. But this 
implies that x, © K, Cy, + clU, C cl(F + U,), a contradiction. a 
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You might think that the set of extreme points of a compact convex 
subset would have to be closed. This is untrue even if the LCS is finite 
dimensional, as Figure V-1 illustrates. 


as 


Figure V-1 


7.9. Proposition. If K is a compact convex subset of a LCS 2,¥Y is a LCS, 
and T: K — Y is a continuous affine map, then T(K ) is a compact convex 
subset of Y and if y is an extreme point of T(K ), then there is an extreme 
point x of K such that T(x) = y. 


PROOF. Because T is affine, T7(K) is convex and it is compact by the 
continuity of 7. Let y be an extreme point of 7(K). It is easy to see that 
T~‘(y) is compact and convex. Let x be an extreme point of T '(y). It 
now follows that x © ext K (Exercise 9). ial 


Note that it is possible that there are extreme points x of K such that 
T(x) is not an extreme point of 7(K). For example, let T be the 
orthogonal projection of R? onto R? and let K = ballR’. 


EXERCISES 


1. If (X,Q, w) is a o-finite measure space and 1 < p < oo, then the set of extreme 
points of ball L?(n) is {f © L?(w): |Ifil, = 1. 


2. If (X,2,) is a o-finite measure space, the set of extreme points of ball L'( 1) is 
{ax,: E is an atom of p, a € F, and |a| = p(E)'}. 


3. If (X,Q,) is a o-finite measure space, the set of extreme points of ball L®(p) 


is (f © L°(p): [f(x)| = Lae. [n]}. 


4. If X is completely regular, the set of extreme points of ball C,(X) 1s { f © C,(X): 
[f(x)| = 1 for all x}. So ball Cy [0,1] has only two extreme points. 


5. Let X be a totally disconnected compact space. (That is, X 1s compact and if 
x © X and U is an open neighborhood of x, then there is a subset V of X that 
is both open and closed and such that x © VC U. The Cantor set is an 
example of such a space.) Show that ball C( X) is the norm closure of the convex 
hull of its extreme points. 


6. Show that ball /' is the norm closure of the convex hull of its extreme points. 


7. Show that if X is locally compact but not compact, then ball C,)(X) has no 
extreme points. 
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8. If # is a LCS and K,,...,K, are compact convex subsets of 2, then 
co(K, U --- UK,) = co(K, U ---: UK,). 


9. Let K be convex and let T: K — ¥Y be an affine map. If y is an extreme point 
of T(K) and x is an extreme point of T~'(y), then x is an extreme point of K. 


10. If # is a Hilbert space, show that T is an extreme point of ball @(.%@) if and 
only if either T or J7* is an isometry. 


§8. An Application: The Stone—Weierstrass Theorem 


If f: X > C is a function, then f denotes the function from X into C 
whose value at each x is the complex conjugate of f(x), f(x). 


8.1. The Stone—Weierstrass Theorem. [f X is compact and & is a closed 
subalgebra of C( X) such that: 


(a) 1 EX; 
(b) if x, y € X and x # y, then there is an f in & such that f(x) # f(y); 
(c) if f € L, then f € x; 

then Z= C(X). 


If C(X) is the algebra of continuous functions from X into R, then 
condition (c) is not needed. Also, an algebra in C( X) that has property (b) 
is said to separate the points of X (see Exercise 1). 

The proof of this result makes use of the Krein—Milman Theorem and is 
due to L. de Branges [1959]. 


PROOF OF THE STONE-—WEIERSTRASS THEOREM. To prove the theorem it 
suffices to show that »/ + = (0) (III.6.14). Suppose  ~ # (0). By Alaoglu’s 
Theorem, ball..7* is weak* compact. By the Krein-Milman Theorem, 
there is an extreme point p of ball .e~. Let K = the support of p. Since 
A+# (0), |u|) =1 and K#Q0. Fix x, in K. It will be shown that 
K = {Xo}. 

Let x € X, x # X,. By (b) there is an f, in © such that f,(x9) # f,(x) 
= B. By (a), the function B € #. Hence f, =f; — BE, f,(xX)) #0 = 
f,(x). By ©), fy = Al? =hh €#. Also, f(x) = 0 < fa(xo) and f, = 0. 
Put f= (lIGll + 74. Then f € #7, f(x) = 0, f(x.) > 0, andO<f<1 
on X. Moreover, because .¥ is an algebra, gf and g(1 — f) © # for every 
g in &. Because pe VH~*, 0= {gfdu = fg(1 — f) dp for every g in #. 
Therefore fu and (1 — f)peEexv~. 

(For any bounded Borel function 4 on X, hy denotes the measure whose 
value at a Borel set A is {,hdp. Note that ||hy|| = {|A| dlp.) 

Put a = ||ful| = {fdlu|. Since f(x.) > 0, there is an open neighborhood 
U of x, and an e > 0 such that f(y) > e for y in U. Thus, a = ffd|p| = 
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Syfd|p| = elu|\(U) > 0 since UN K # O. Similarly, since f(x.) < 1, a < 1. 
Therefore, 0 <a<1. Also, 1 —-a=1— ffdlp| = {A — f)dip| = la —- 


f )pl|. Since 
yg TN weg a ap 
req = re ary 


and pw is an extreme point of ball.7+1, w= fullful| | =a ‘fu. But the 
only way that the measures p and a ‘fu can be equal is if a ‘f= 1 ae. [py]. 
Since f is continuous, it must be that f= aon K. Since x, € K, f(x,) = a. 
But f(x,)) > f(x) = 0. Hence x € K. This establishes that K = {x,} and 
so # = 6, where |y| = 1. But hE" and 1€X%,s0o0= fldu=y, a 
contradiction. Therefore ./ * = (0) and = C(X). ies 


With an important theorem it is good to ask what happens if part of the 
hypothesis is deleted. If x» € X and w= {fe C(X): f(x,) = O}, then # 
is a closed subalgebra of C(X) that satisfies (b) and (c) but #%# CCX). 
This is the worse that can happen. 


8.2. Corollary. If X is compact and & is a closed subalgebra of C( X) that 
separates the points of X and is closed under complex conjugation, then either 
= C(X) or there is a point x, in X such that W= { f © C(X): f(x9) = O}. 


PROOF. Identify F and the one-dimensional subspace of C( X) consisting of 
the constant functions. Since .7 is closed, &%+ F is closed (III.4.3). It is 
easy to see that + F is an algebra and satisfies the hypothesis of the 
Stone—Weierstrass Theorem; hence 7+ F = C(X). Suppose ~# C(X). 
Then C(X)/# is one dimensional; thus .~* is one dimensional (Theorem 
2.2). Let pe V+, |u| = 1. If f € a, then fu ©. ~; hence there is an a 
in F such that fy = ap. This implies that each f in # is constant on the 
support of uw. But the functions in ./ separate the points of X. Hence the 
support of pw is a single point x) and so ~~ = {Bd,: BEF}. Thus 
A=*IF+={fEec(x): f(x) = 0}. fal 


There are many examples of subalgebras of C( X) that separate the points 
of X, contain the constants, but are not necessarily closed under complex 
conjugation. Indeed, a subalgebra of C( X) having these properties is called 
a uniform algebra or function algebra and their study forms a separate area 
of mathematics (Gamelin [1969]). One example (the most famous) 1s ob- 
tained by letting X be a subset of C and letting “= R(X) = the uniform 
closure of rational functions with poles off X. 

Let xo, x, © X,xX9 # X,, and let W= {fe CCX): f(xo) = f(x,)}. Then 
& is a uniformly closed subalgebra of C(X), contains the constant func- 
tions, and is closed under conjugation. In a certain sense this is the worst 
that can happen if the only hypothesis of the Stone—Weierstrass Theorem 


V.8. An Application: The Stone-Weierstrass Theorem 151 


that does not hold is that » fails to separate the points of X (see 
Exercise 4). 
If X is only assumed to be locally compact, then the story is similar. 


8.3. Corollary. If X is locally compact and & is a closed subalgebra of 
Cyo( X) such that 


(a) for each x in X there is an f in & such that f(x) # 0; 
(b) separates the points of X, 
(c) f © H whenever f © &; 


then LZ= C(X). 


PRooF. Let X,, = the one point compactification of X and identify C)(X) 
with {fe CCX,,): f(c) = 0}. So Ww becomes a subalgebra of CCX,,). 
Now apply Corollary 8.2. The details are left to the reader. 7 


What are the extreme points of the unit ball of M(X)? The characteriza- 
tion of these extreme points as well as the extreme points of the set P(X) of 
probability measures on X is given in the next theorem. [A probability 
measure iS a positive measure p such that p(X) = 1.] 


8.4. Theorem. Jf X is compact, then the set of extreme points of ball M(X) 
1S 


{a5,: |a| = landx € X}. 
The set of extreme points of P(X), the probability measures on X, is 
{6,:x EX}. 


PROOF. It is left as an exercise for the reader to show that if x € X, 6, is 
an extreme point of P(X) and aé, is an extreme point of ball MCX) 
(Exercise 3). 

It will now be shown that if » is an extreme point of P(X), then p is an 
extreme point of ball M(X). Thus the first part of the theorem implies the 
second. Suppose p, is an extreme point of P(X) and »p,,v, € ball MCX) 
such that w= }(v, +). Then 1 = [jp] < 3(\J4l| + [[rall) < 1; hence ||»,| 
+ ||v3|| = 2 and so ||»,|| = ||»2|| = 1. Also, 1 = w(X) = 3(7,(X) + »(X)). 
Now |v,(X)|, |v,(X)| < 1 and 1 is an extreme point of (a © F: ja| < 1}. 
Hence for k = 1,2, ||v,|| = »,(X) = 1. By Exercise III.7.2, v, € P(X) for 
k = 1,2. Since p © ext P(X), w=», = v5. So p is an extreme point of 
ball MCX). Thus it suffices to prove the first part of the theorem. 

Suppose that pw is an extreme point of ball M(X) and let K be the 
support of p. That is, 


K = X\ U{V: V is open and |p|(V) = 0}. 


Hence |u|(X \ K) = 0 and {fdyu = {,fdp for every f in CCX). It will be 
shown that K is a singleton set. 
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Fix x, in K and suppose there is a second point x in K, x # xy. Let U 
and V be open subsets of X such that x,» © U, x © V,andclUN clV = 0. 
By Urysohn’s Lemma there is an f in C(X) such thatO < f<1, f(y)=1 
for y in clU, and f(y) =0 for y in clV. Consider the measures fp and 
(1 — f)p. Put a = |[full = fi dlul| = ffdlu|. Then a = ffd|p| < ||u|| = 1 
and a = f{fd|u| = |w|(U) > 0 since U is open and UN K #0. Also, 1 — «@ 
=1— ffdlp| = fA — f)d|y| = |] — ful] and sol —ae fy — f)d|u! 
= |w|(V) > 0 since x € K. Hence 0 <a < 1. 

But fu/a and (1 — f)p/ — a) & ball MCX) and 

fu (2 —f)e 
n= a4] +( «| or | 
Since p is an extreme point of ball M(X) and a #0, p = fu/a. This can 
only happen if f=a<1 ae. [p]. But f=1 on U and |p\(U)> 0, a 
contradiction. Hence K = {x}. 

Since the only measures whose support can be the singleton set { x, } have 

the form ad, , a in F, the theorem 1s proved. ei 


EXERCISES 


1. Suppose that © is a subalgebra of C(X) that separates the points of X and 
1 € &. Show that if x,,...,x, are distinct points in X and a,,...,a, © F, there 
is an f in # such that f(x;)= a, forl <j <n. 


2. Give the details of the proof of Corollary 8.3. 


3. If X is compact, show that for each x in X, 6, is an extreme point of P(X) and 
ad, |a| = 1, is an extreme point of ball MCX). 


4. Let X be compact and let w be a closed subalgebra of C( X) such that 1 © W 
and » is closed under conjugation. Define an equivalence relation ~ on X by 
declaring x ~ y if and only if f(x) = f(y) for all f in . Let X/~ be the 
corresponding quotient space and let 7: X — X/~ be the natural map. Give 
X/~ the quotient topology. (a) Show that if f ©.%, then there is a unique 
function 7*(f) in C(X/~) such that 7*(f)oa =f. (b) Show that 7*: #%—- 
C(X/~ ) is an isometry. (c) Show that 7* is surjective. (d) Show that w= {fe 
C(X): f(x) = f(y) whenever x ~ y}. 


5. (This exercise requires Exercise IV.4.7.) Let X be completely regular and topolo- 
gize C(X) as in Example IV.1.5. If 7 is a closed subalgebra of C(X) such that 
1 €. YW, WM separates the points of X, and f ©. whenever f Ev, then Y= 
C(X). 


6. Let X,Y be compact spaces and show that if fe C(X X Y) and e > 0, then 
there are functions g,,...,g, in C(X) and h,,...,h, in C(Y) such that 
IPC) — La Be (X)A(Y)| < € for all (x, y) in X x Y. 


7. Let & be the uniformly closed subalgebra of C,(IR) generated by sinx and 
cos x. Show that #= {fe C,(R): f(t) = f(t + 27) for all ¢ in R}. 


n 


8. If K is a compact subset of C, f€ C(K), and e > 0, show that there is a 
polynomial p(z,Z) in z and z such that |f(z) — p(z,z)| < e for all z in K. 
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§9*. The Schauder Fixed-Point Theorem 


Fixed-point theorems hold a fascination for mathematicians and they are 
very applicable to a variety of mathematical and physical situations. In this 
section and the next two such theorems are presented. 

The results of this section are different from the rest of this book in an 
essential way. Although we will continue to look at convex subsets of 
Banach spaces, the functions will not be assumed to be linear or affine. This 
is a small part of nonlinear functional analysis. 

To begin with, recall the following classical result whose proof can be 
found in any algebraic topology book. (Also see Dugundji [1966].) 


9.1. Brouwer’s Fixed-Point Theorem. Jf 1 < d < co, B = the closed unit 
ball of R“, and f: B > B is a continuous map, then there is a point x in B 
such that f(x) = x. 


9.2. Corollary. If K is a nonempty compact convex subset of a finite-dimen- 
sional normed space & and f: K — K is a continuous function, then there is a 
point x in K such that f(x) = x. 


PROOF. Since % is isomorphic to either C? or R%, it is homeomorphic to 
either R°% or R“%. So it suffices to assume that = R47, 1 < d < oo. If 
K={x €R*%: ||x|| <r}, then the result is immediate from Brouwer’s 
Theorem (Exercise). If K is any compact convex subset of R%, let r > 0 
such that KC B= {x ER: ||x|| <r}. Let ¢: B—K be the function 
defined by (x) = the unique point y in K such that ||x — y|| = dist(x, K) 
(1.2.5). Then ¢ is continuous (Exercise) and (x) = x for each x in K. (In 
topological parlance, K is a retract of B.) Hence fo¢: B> KCB is 
continuous. By Brouwer’s Theorem, there is an x in B such that f(¢(x)) = 
x. Since fo @(B) C K, x © K. Hence $(x) = x and f(x) = x. aa 


Schauder’s Fixed-Point Theorem is a generalization of the preceding 
corollary to infinite-dimensional spaces. 


9.3. Definition. If 2 is a normed space and E C 2%, a function f: E > & 
is said to be compact if f is continuous and cl f(A) is compact whenever A 
is a bounded subset of E. 


If £ is itself a compact subset of 2, then every continuous function from 
E into @ is compact. 
The following lemma will be needed in the proof of Schauder’s Theorem. 


9.4. Lemma. /f K is a compact subset of the normed space 2, ¢ > 0, and A 
is a finite subset of K such that K C U{ B(a; ): a © A}, define $,: K > & 


by 
» {m,(x)a: a € A} 


a(x) = i {m,(x): ae A} 
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where m ,(x) = 0 if ||x — a|| > e andm,(x) =e — ||x — all if ||x — all <.e. 
Then , is a continuous function and 


lIbg(x) — xl] <e 
for all x in K. 


PROoF. Note that for each a in A, m,(x) => 0 and L{m,(x): a€ A} > 0 
for all x in K. So ¢, is well defined on K. The fact that @, is continuous 
follows from the fact that for each a in A, m,: K — [0,€] is continuous. 
(Verify!) 

If x € K, then 


aig eee oe et 
o4(x) Xx Tine): 7 A} , 


If m,(x) > 0, then {jx — aj] < e. Hence 


hla (xila = alls ae A} 
lolx) — SS cay 


This concludes the proof. Ed 


9.5. The Schauder Fixed-Point Theorem. Let E be a closed bounded convex 
subset of a normed space &. If f: E > is a compact map such that 
f{(E) C E, then there is an x in E such that f(x) = x. 


PRoor. Let K = cl f(E), so K © E. For each positive integer n let A,, be a 
finite subset of K such that K C U{ B(a;1/n): a © A,,}. For each n let 
o, = >, as in the preceding lemma. Now the definition of ¢, clearly implies 
that ,(K )C co(K) C E since E is convex; thus f, = ¢,° f maps E into 
E. Also, Lemma 9.4 implies 


9.6 Wf,(«) —f(x)\| <1/n for x in E. 


Let 2, be the linear span of the set A, and put E, = EM %,.So 2, isa 
finite-dimensional normed space, E, is a compact convex subset of %,,, and 
f,. E, > E,, (Why?) is continuous. By Corollary 9.2, there is a point x,, in 
E,, such that f,(x,,) = X,, 

Now { f(x,,)} is a sequence in the compact set K, so there is a point x, 
and a subsequence {f(%n)} such that f(x,,) > X,. Since fn (Xn,) = ids 


(9.6) implies 
IX,, — Xoll < Whe, (%2,) — f(x, + We.) > xl 
1 
s n, a f(x, ) — Xoll- 


Thus x, — X 9. Since f is continuous, f(x,) = lim f(x, ) = Xo. a 
J } 


There is a generalization of Schauder’s Theorem where 2% is only assumed 
to be a LCS. See Dunford and Schwartz [1958], p. 456. 
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EXERCISE 


1 Let E= {x e€/?(N): ||x|] <1} and for x in E define f(x) = ((1 — ||x]]), 
x(1), x(2), ...). Show that f(E) C E, f is continuous, and f has no fixed points. 


§10*. The Ryll—-Nardzewski Fixed-Point Theorem 


This section begins by proving a fixed-point theorem that in addition to 
being used to prove the result in the title of this section has some interest of 
its own. Recall that a map T defined from a convex set K into a vector 
space is said to be affine if T(La;x,) = La,T(x,;) when x, © K, a, > 0, and 
La, = 1. 


10.1. The Markov—Kakutani Fixed-Point Theorem. If K is a nonempty 
compact convex subset of a LCS 2 and F¥ is a family of continuous affine 
maps of K into itself that is abelian, then there is an x, in K such that 
T(X9) = Xo for all T in F. 


ProoF. If T € ¥ and n > 1, define T’”: K > K by 
1 n—l 
POSS ST 
Peg 


If S and T€¥ and n,m > 1, then it is easy to check that S°VT(™ = 
TOMS, Let H= {T(K): T EF, n>1}. Each set in ¥ is compact 
and convex. If 7),...,7,<&F and n,,...,n y = 1, then the commutativity 
of ¥# implies that Tend) + TK) CNP {T(K). This says that 
has the finite intersection property and hence there is an x, in f\{ B: 
Be HX}. It is claimed that x, is the desired common fixed point for the 
maps in F¥. 

If T€¥F and n > 1, then x, € T’(K ). Thus there is an x in K such 
that 


] 
i= Tie) = ale + T(x) +--+) +7" '(x)]. 
Using this equation for xq, it follows that 


T(x) — xo = a (T(x) + + +(x) 
~=[x4 T(x) + +7) 


[T"(x) — x] 


Te ae 


e—[K- K]. 
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Now K is compact and so K — K is also. If U is an open neighborhood of 0 
in &, there is an integer n >1 such that n ‘[K — K]C U. Therefore 
T(X.) — Xo © U for every open neighborhood U of 0. This implies that 
T(X9) — Xo = 0. a 


If p is a seminorm on & and A C &@, define the p-diameter of A to be 
the number 


p-diam A = sup{ p(x — y): x,y € A}. 


10.2. Lemma. If 2 is a LCS, K is a nonempty separable weakly compact 
convex subset of &, and p is a continuous seminorm on &, then for every 
e > 0 there is a closed convex subset C of K such that: 


(a) C# K; 
(b) p-diam(K \ C) < e. 


Proor. Let S = {x € #: p(x) < &/4} and let D = the weak closure of the 
set of extreme points of K. Note that DC K. By hypothesis there is a 
countable subset A of K such that DC K C U{a+ S: a € A}. Now each 
a + S is weakly closed. (Why?) Since D is weakly compact, there is an a in 
A such that (a + S$) D has interior in the relative weak topology of D 
(Exercise 2). Thus, there is a weakly open subset W of 2 such that 


10.3 (a+ S)ADDWNAD#O. 


Let K, = co(D\ W) and K, =co(DM W). Because K, and K, are 
compact and convex and K, U K, contains the extreme points of K, the 
Krein—Milman Theorem and Exercise 7.8 imply K = co(K, U K,). 


10.4. Claim. K, # K. 


In fact, if K, = K, then K = co(D\ W) so that ext K C D\ W (Theo- 
rem 7.8). This implies that D C D\ W, or that W 1 D = (0a contradiction 
to (10.3). 

Now (10.3) implies that K, C a + S; so the definition of S implies that 
p-diam K, < €/2. Let 0 <r <1 and define f: K, X K, x[r,l]} > K by 
f(x), X5,t) = tx, + I — t)x,. So f, is continuous and C, = f,(K, X K, Xx 
[r,1]) is weakly compact and convex. (Verify!) 


10.5. Claim. C.# K for0 <r<l. 


In fact, if C. = K and e © ext K, then e = tx, + (1 — t)x, for some 1, 
r<t<1, x, in K,. Because e is an extreme point and t # 0, e = x,. Thus 
ext K C K, and K = K,, contradicting (10.4). 

Let y © K\ C.. The definition of C, and the fact that K = co(K, U K,) 
imply y = tx, + (1 — t)x, with x, in K; and 0 <?¢ <r. Hence p(y — x>) 
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= p(t(x, — X5)) = tp(x, — x,) < rd, where d= p-diam K. Therefore, if 
y=txi + —-t’)x,EK\C, then p(y — y’) < p(y — x2) + p(x, - 
x5) + p(x4 — y’) < 2rd + p-diam K, < 2rd + €/2. Choosing r = «/4d 
and putting C = C,, we have proved the lemma. Ie 


10.6. Definition. Let 2 be a LCS and let Q be a nonempty subset of %. If 
SF is a family of maps (not necessarily linear) of Q into Q, then is said to 
be a noncontracting family of maps if for two distinct points x and y in Q, 


0¢cl{T(x)-Tly): TES}. 


The next lemma has a straightforward proof whose discovery is left to the 
reader. 


10.7. Lemma. Jf 2 isa LCS, Q C &, and ¥ is a family of maps of QO into 
Q, then S is a noncontracting family if and only if for every pair of distinct 
points x and y in Q there is a continuous seminorm p such that 


inf{ p(T(x) — T(y)): TE SF} > 0. 


10.8. The Ryll-Nardzewski Fixed-Point Theorem. If % is a LCS, Q is a 
weakly compact convex subset of &, and F£ is a noncontracting semigroup of 
weakly continuous affine maps of Q into Q, then there is a point x, in QO such 
that T(X9) = Xq for every T in F. 


PrRooF. The proof begins by showing that every finite subset of Y has a 
common fixed point. 


10.9. Claim. If {7),...,7,,} GC A, then there is an x, in Q such that 
T,X) =X, forl<k <n. 


Put 7) = (7, + --- +T7,,)/n; so Ty: Q > Q and Ty is weakly continuous 
and affine. By (10.1), there is an x, in Q such that 7)(x,)) = xj. It will be 
shown that 7,(x9) = xX) for 1 < k <n. In fact, if T,(x,)) # x, for some k, 
then by renumbering the 7,, it can be assumed that there is an integer m 
such that T,(x 9) # x9 for 1 < k < mand T,(x,) = X) for m<k <n. Let 
Ty = (7, + +--+ +T,,)/m. Then 


Xo = Ty (Xo) 


= ~[T,(x0) af ee + T,,(x9)] +(4 — aoe 


n 
Hence 
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Thus it may be assumed that 7,(x,) # X, for all k, but 7o(x9) = x9. Make 
this assumption. 

By Lemma 10.7, there is an ¢ > 0 and there is a continuous seminorm p 
on & such that for every Tin Y andl <k <a, 


10.10 P(T(T,(xo)) - T(x,)) >. 


a SF, = the semigroup generated by {7),75,...,7,,}. So “% CS and 
={T, ---T,: m2=1,1</,<n}. Thus % isa countable subsemi- 
ae of Y. Put K= co{ T (Xp): Té~fF,}. Therefore K is a weakly 
compact convex subset of Q and K is separable. By Lemma 10.2, there is a 
closed convex subset C of K such that C # K and p-diam(K\ C) < «. 
Since C # K, there is an S in Y, such that S(x,) © K \ C. Hence 


S(X9) = STy(xo) = ~ (ST, (xo) + +++ +S8T,(%)| € K\C. 


Since C is convex, there must bea k,1 < k < n, such that ST,(x)) © K\ C. 
But this implies that p(S(T,(x,)) — S(x9)) < p-diam(K \ C) < e, con- 
tradicting (10.10). This establishes Claim 10.9. 

Let Y= all finite nonempty subsets of Y. If F&F, let 0, = {x © Q: 
T(x) = x for all T in F}. By Claim 10.9, Q, # O for every F in #. Also, 
since each J in Y is weakly continuous and affine, Q, 1s convex and 
weakly compact. It is easy to see that {Q,: F © F } has the finite intersec- 
tion property. Therefore, there is an x» nf\{Q;: F © F }. The point x, is 
the desired common fixed point for /. 2 


The original reference for this theorem is Ryll-Nardzewski [1967]; the 
treatment here is from Namioka and Asplund [1967]. An application of this 
theorem is given in the next section. 


EXERCISES 
1. Was local convexity used in the proof of Theorem 10.1? 


2. Show that if X is locally compact and X = U%_, F,, where each F, is closed in 
X, then there is an integer n such that int F, # O. (Hint: Look at the proof of the 
Baire Category Theorem.) 


§11*. An Application: Haar Measure on a 
Compact Group 


In this section the operation on all semigroups and groups is denoted by 
multiplication. 
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11.1. Definition. A ‘topological semigroup is a semigroup G that also is a 
topological space and such that the map G X G > G defined by (x, y) > xy 
is continuous. A topological group is a topological semigroup that is also a 
group such that the map G > G defined by x + x7! is continuous. 


So a topological group is both a group and a topological space with a 
property that ties these two structures together. 


11.2. Examples 


(a) N and R , , are topological semigroups under addition. 

(b) Z, R, and C are topological groups under addition. 

(c) OD is a topological group under multiplication. 

(d) If X is a topological space and G = {fe C(X): f(X) C dD}, define 
( fg)(x) = f(x)g(x) for f, g inG and x in X. Then G is a group. If G is 
given the topology of uniform convergence on X, G is a topological 
group. 

(e) For n > 1, let M,(C) = the n X n matrices with entries in C; O(n) = 
{A © M,(C): A is invertible and A~' = A*}; SO(n) = {A © O(n): 
det A = 1}. If M,(C) is given the usual topology, O(n) and SO(n) are 
compact topological groups under multiplication. 


There are many more examples and the subject is a self-sustaining area of 
research. Some good references are Hewitt and Ross [1963] and Rudin 
[1962]. 


11.3. Definition. If S is a semigroup and f: S — F, then for every x in S 
define f,: S > F and ,f: S > F by f(s) = f(sx) and , f(s) = f(xs) for all 
s in S. If S is also a group, let f*(s) = f(s~*) for all s in S. 


11.4. Theorem. Jf G is a compact topological group, then there is a unique 
positive regular Borel measure m on G such that 


(a) m(G) = 1; 

(b) if U is a nonempty open subset of G, then m(U) > 0; 

(c) if A is any Borel subset of G and x € G, then m(A) = m(Ax) = m(xA) 
= m(A~'), where Ax = {ax: aG A}, xA = {xa: ae A}, and A! = 
{a-*: aE A}. 


The measure m is called the Haar measure for G. If G is locally compact, 
then it is also true that there is a positive Borel measure m on G satisfying 
(b) and such that m(Ax) = m(A) for all x in G and every Borel subset A of 
G. It is not necessarily true that m(4) = m(xA), let alone that m(A) = 
m(A~') (see Exercise 4). The measure m is necessarily unbounded if G is 
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not compact, so that (a) is not possible. Uniqueness, however, is still true in 
a modified form: if m,,m, are two such measures, then m, = am, for 
some a > 0. 

By using the Riesz Representation Theorem for representing bounded 
linear functionals on C(G), Theorem 11.4 is equivalent to the following. 


11.5. Theorem. Jf G is a compact topological group, then there exists a 
unique positive linear functional I: C(G) — F such that 


(a) (0) = 1; 
(b) if f= C(G), f = 0, and f ¥ 0, then I(f) > 0: 
(c) iff € C(G) and x € G, then I(f) = I(f,) = IC. f) = I(f*). 


Before proving Theorem 11.5, we need the following lemma. For a 
compact topological group G, if x © G, define L,: M(G) > M(G) and R,: 
M(G) > M(G) by 


Cf. E(u) = ffdp, 


Cf, Ri(w)) = [fan 


for f in C(G) and p in M(G). Define So: M(G) > M(G) by 


(f,So(#)) = [f* dp 


for f in C(G) and p in M(G). It is easy to check that L,, R,, and S, are 
linear isometries of M(G) onto M(G) (Exercise 5). 


11.6. Lemma. Jf G is a compact topological group, 4 © M(G), and op: 
G X G > (M(G), wk*) is defined by p(x, y) = L,R,("), then p is continu- 
ous. Similarly, if py: GX G—(M(G),wk*) is defined by p)(x. y)= 
SoL,R,(u), then p is continuous. 


Proor. Let f € C(G) and let e > 0. Then (Exercise 10) there is a neighbor- 
hood U of e (the identity of G) such that |f(x) — f(y)| < e whenever 
xy-'€U or x 'y € U. Suppose {(x,, y,)} is a net in G X G such that 
(x;, ¥,) > (x, y). Let ig be such that for i > ip, x,x ' € Uand y,'y © U. 
If x €G, then [f(x,zy) — f(azy)| < If(x,zy,) — fzy,)| + [fezy,) - 
f(xzy)|. But if i>i, and z&G, (x,zy,)(xzy,)' = x,x~' © U and 
(xzy,)~(xzy) = y, ty © U. Hence |f(x,zy,) — f(xzy)| < 2e for i > ig and 
for all z in G. Thus lim, {f(x,zy,) du(z) = {f(xzy) du(z). Since f was 
arbitrary, this implies that p(x,, y,) > p(x, y)wk* in M(G). The proof for 
Po is similar. ies 
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PROOF OF THEOREM 11.5. If e = the identity of G, then 


L,R, =R,L, 
LL, = Ly, 
11.7 RR, = R,, 


Sj = L, = R, = the identity on M(G) 
SoL,R, = L,-1R,-1Sp 


for x, y in G. Hence 
(SLR, )(SoL,R,) i: (LR ,-1Sy)(SoL,R,) 
= L1R,1L,R, 
= LaLb,R,-aR, 
=f L,y-1R,-1,. 
Hence if S, = the identity on M(G), 
S={SL.R,:i=0,1; x,y eG} 


is a group of surjective linear isometries of M(G). Let Q = the probability 
measures on G; that is, Q = {uw © M(G): w» > 0 and p(G) = 1}. So 2 isa 
convex subset of M(G) that is wk* compact. Furthermore, 7(Q) C Q for 
every T in . 


11.8. Claim. If » © M(G) and p # 0, then 0 € the weak* closure of { T( 1): 
Tre? >}. 


In fact, Lemma 11.6 implies that {7(w): T © £ } is weak* closed. Since 
each J in & is an isometry, T(u) # 0 for every T in Y. 

By Claim 11.8, Y is a noncontracting family of affine maps of Q into 
itself. Moreover, if T= SoL,R, and {p,} is a net in Q such that yp, > 
w(wk*), then for every f in C(G), (f,T(H,)) = [f(xs !y) dus) > 
If(xs”“y) dp = (f,T(p)). So each T in & is wk* continuous on Q. By the 
Ryll—Nardzewski Fixed-Point Theorem, there is a measure m in Q such 
that 7(m) = m for all Tin Y. 

By definition, (a) holds. Also, for any x in G and f in C(G), 
[f(xs) dm(s) = (f, L,(m)) = {fdm. By similar equations, (c) holds. Now 
suppose fe C(G), f=>0, and f #0. Then there is an e > 0 such that 
U = {x ©G: f(x) > €} is nonempty. Since U is open, G = U{Ux: x € G}, 
and G is compact, there are x,,x,,...,x, mm G such that G C U?_,Ux,. 
(Why is Ux open?) Define g,(x) = f(xx,') and put g = X7_,g,. Then 
g = C(G) and {gdm = Li_,/g,dm = nffdm by (c). But for any x in G 
there is an x, such that xx,’ © U; hence g(x) > g,(x) = f(xx,')> «. 
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Thus 


[fam == [gdm > ¢/n > 0. 


This proves (b). 

To prove uniqueness, let be a probability measure on G_ having 
properties (a), (b), and (c). If fe C(G) and x € G, then ffdu = {.fdu. 
Hence 


[rau =f] [f(0) du(»)| amc) 
= f| fro) au(y)) ant) 
= {| f4(e) am(x)} duly) 
= f| fel) amc] dul) 


= | fam. 


Hence p = m. e 


For further information on Haar measure see Nachbin [1965]. 

What happens if G is only a semigroup? In this case L, and R, may not 
be isometries, so {L,R,: x, y © G} may not be noncontractive. However, 
there are measures for some semigroups that are invariant (see Exercise 7). 
For further reading see Greenleaf [1969]. 


EXERCISES 


1. Let G be a group and a topological space. Show that G is a topological group if 
and only if the map of G X G > G defined by (x, y) ~ x 'y is continuous. 


2. Verify the statements in (11.2). 
3. Show that Theorems (11.4) and (11.5) are equivalent. 


4. Let G be a locally compact group. If m is a regular Borel measure on G, show 
that any two of the following properties imply the third: (a) m(Ax) = m(A) for 
every Borel set A and every x in G; (b) m(x4) = m(A) for every Borel set A 
and every x in G; (c) m(A) = m(4~') for every Borel set A. 


5. Show that the maps S), L,, R, are linear isometries of M(G) onto M(G). 
6. Prove (11.7). 


7. Let S be an abelian semigroup and show that there is a positive linear 
functional L: /~(S) — F such that (a) L(1) = 1, (b) L(f.) = L(f) for every f 
in /*(S). 
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8. Show that if S and L are as in Exercise 7, and S is infinite, then L(f) = 0 
whenever {s © S: f(s) # 0} is finite. 


9. If S = N, what does Exercise 7 say about Banach limits? 


10. If G is a compact group, f: G — F is a continuous function, and e > 0, show 
that there is a neighborhood U of the identity in G such that |[f(x) — f(y)| <« 
whenever xy! € U. (Note that this says that every continuous function on a 
compact group is uniformly continuous.) 


11. If G is a locally compact group and f € C,(G), let O( f) = the closure of { f,: 
x € G} in C,(G). Let AP(G) = {f © C,(G): O(f) is compact}. Functions in 
AP(G) are called almost periodic. (a) Show that every periodic function in 
C,(IR) belongs to AP(R). (b) If G is compact, show that AP(G) = C(G). (c) 
Show that if f © C,(R), then f € AP(R) if and only if for every e > O there is a 
positive number T such that in every interval of length 7 there is a number p 
such that |f(x) — f(x + p)| < e for all x in R. (d) If G is not compact, then 
the only function in AP(G) having compact support is the zero function. (e) 
Prove that there is a bounded linear functional L: AP(G)— F such that 
LQ)=1, L(f)=O if f=, and L(f,) = L(f) for all f in AP(G) and x 
in G. 


§12*. The Krein—Smulian Theorem 


Let A be a convex subset of a Banach space 2. If A is weakly closed, then 
for everyr>0, AN {x © @: ||x|| < 7} is weakly closed; this is clear since 
each of the sets in the intersection is weakly closed. But the converse of this 
is also true: if A is convex and AM { X € &: ||x|| < r} is weakly closed for 
every r > 0, then A is weakly closed. In fact, because A is convex it suffices 
to prove that A is norm closed (Corollary 1.5). If {x,} ¢ A and ||x, — xo 
-> Q, then there is a constant r such that ||x,|| <r for all ”. By hypothesis, 
AM {x ©€4: ||x|| <r} is weakly closed and hence norm closed. Thus 
X6-< A; 

Now let A be a convex subset of 2*, % a Banach space. If AN {x* © 
X*: ||x*|| <r} is weak-star closed for every r > 0, is A weak-star closed? 
If Z is reflexive, then this is the same question that was asked and answered 
affirmatively in the preceding paragraph. If 2 is not reflexive, then the 
preceding argument fails since there are norm-closed convex subsets of % * 
that are not weak-star closed. (Example: let x** € @**\ % and consider 
A = kerx**.) Nevertheless, even though the argument fails, the statement is 
true. 


12.1. The Krein—Smulian Theorem. Jf 2 is a Banach space and A is a 
convex subset of &* such that AN {x* © &*: ||x*|| <r} is weak-star 
closed for every r > 0, then A is weak-star closed. 


To prove this theorem, two lemmas are needed. 
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12.2. Lemma. If 2 is a Banach space, r > 0, and F¥, is the collection of all 
finite subsets of {x © #: ||x|| <r-'}, then 


Fo FEF} = (x* e¥*: |[x*|] <r}. 


Proor. Let E=f\{ F°: F © ¥}; it is easy to see that r(ball ¥*) C E. If 
x* € r(ball ¥*), then there is an x in ball 2% such that |(x,x*)| > r. 
Hence |(r-'x,x*)|>landx*¢€E. & 


12.3. Lemma. If A and & satisfy the hypothesis of the Krein—Smulian 
Theorem and, moreover, A ( ball ¥* =O, then there is an x in & such that 


Re(x,x*) > 1 
for all x* in A. 


Proor. The proof begins by showing that there are finite subsets Fo, F;, ... 
of & such that 


(i) nF, C ball Z; 


a (ii) n(ball @*) Nj Fe AA =O. 

To establish (12.4) use induction as follows. Let Fy = (0). Suppose that 
Fo,.-.,F,_, have been chosen satisfying (12.4) and set Q=[(n + 
1)ball &*] ANZ FY 21 A. Note that Q is wk* compact. So if Q@ F° #0 
for every finite subset F of n~'ball 2, then 0 # QO((F®: F is a finite 
subset of n~ ‘(ball %)} = QM [n(ball X*)] by the preceding lemma. This 
contradicts (12.4ii). Therefore there is a finite subset F, of n~ ‘(ball 2) such 
that Q 1 F° = (1. This proves (12.4). 

If {F,}°_, satisfies (12.4), then A %_,F.° =. Arrange the elements 
of U%_,F, in a sequence and denote this sequence by {x,,}. Note that 
lim||x,|| = 0. Thus if x* © #*, {(x,,x*)} € co. Define 7: 4* > cy by 
T(x*) = {(x,, x*)}. It is easy to see that T is linear (and bounded, though 
this fact is unnecessary). Hence 7T(A) is a convex subset of cy. Also, from 
the construction of {x,}=U%_,F, for each x* in A, ||7(x*)|| = 
sup, |(x,,x*)| > 1. That is, T(A) | ball cg = 0. Thus Theorem III.3.7 ap- 
plies to the sets T( A) and int[ball c,] and there is an f in /' = c% and an a 
in R such that Re(¢, f) < a < Re(T(x*), f) for every ¢ in int[ball c,] and 
x* in A. That is, 


CO oe) 
12.5 Re > o(n)f(n) <a < Re > (x,, x*)f(n) 

n=1 n=1 
for every ¢ in C, with ||¢|| < 1 and for every x* in A. Replacing f by f/||/|| 
and a by a/||/\|, it is clear that it may be assumed that (12.5) holds with 
I/l| = 1. If @ € co, |]o|] < 1, let p © F such that |u| = 1 and (pe, f) = 
I<, f)|. Applying this to (12.5) and taking the supremum over all ¢ in 
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int[ball cy] gives that 1 < ReX%_,(x,, x*)f(n) for all x* in A. But fe /’ 


n=l] 


sox = Lj f(n)x, © X and 1 < Rex, x*) for all x* in A. ey 


n=] 


Where was the completeness of 2 used in the preceding proof? 


PROOF OF THE KREIN-SMULIAN THEOREM. Let xf © %*\ A; it will be 
shown that xj € wk* — cl A. It is easy to see that A is norm closed. So 
there is an r > QO such that {x* © @™*: ||x* — xf|| <r} OA =O. But this 
implies that ball 2* 4 [r~ ‘(A — x%)] =O. With this it is easy to see that 
r_'(A — xX) satisfies the hypothesis of the preceding lemma. Therefore 
there is an x in & such that Re(x, x*) > 1 for all x* in r-'(A — x%). In 
particular, 0 € wk* — cl[r-'(A — x*)] and hence x*¥ € wk* —clA. & 


12.6. Corollary. If 2 is a Banach space and Y is a linear manifold in % *, 
then Y is weak-star closed if and only if ¥Mball 2 * is weak-star closed. 


12.7. Corollary. If 2 is a separable Banach space and A is a convex subset 
of &* that is weak-star sequentially closed, then A is weak-star closed. 


PROOF. Because % is separable, r(ball Z *) is weak-star metrizable for every 
r>0 (Theorem 5.1). So if A is weak-star sequentially closed, AN 
[r(ball 2 *)] is weak-star closed for every r > 0. Hence the Krein—Smulian 
Theorem applies. a 


This last corollary is one of the most useful forms of the Krein-Smulian 
Theorem. To show that a convex subset A of #* is weak-star closed it is 
not necessary to show that every weak-star convergent net from A has its 
limit in A; it suffices to prove this for sequences. 


12.8. Corollary. If 2 is a separable Banach space and F: #* > F isa 
linear functional, then F is weak-star continuous if and only if F is weak-star 
sequentially continuous. 


PROOF. By Theorem IV.3.1, F is wk* continuous if and only if ker F is wk* 
closed. This corollary is, therefore, a direct consequence of the preceding 
one. ua 


There is a misinterpretation of the Krein—Smulian Theorem that the 
reader should be warned about. If A is a weak-star closed convex subset of 
ball @*, let M@=Uf{rA: r> 0}. It is easy to see that @ is a linear 
manifold, but it does not follow that .W 1s weak-star closed. What is true is 
the following. 


12.9. Theorem. Let 2 be a Banach space and let A be a weak-star closed 
subset of &*. If Y= the linear span of A, then Y is norm closed in #* if 
and only if Y is weak-star closed. 
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The proof will not be presented here. The interested reader can consult 
Dunford and Schwartz [1958], p. 429. 

There is a method for finding the weak-star closure of a linear manifold 
that is quite useful despite its seemingly bizarre appearance. Let % be a 
Banach space and let -@ be a linear manifold in %*. For each ordinal 
number « define a linear manifold 7, as follows. Let 47, =.Z@. Suppose a 
is an ordinal number and .@, has been defined for each ordinal 8 < a. If a 
has an immediate predecessor, a — 1, let 4, be the weak-star sequential 
closure of 4, _,. If a is a limit ordinal and has no immediate predecessor, 
let 4, = U{.@,: B < a}. In each case ./, is a linear manifold in 2* and 
Mp cM, if B<a. 


12.10. Theorem. Jf Z is a separable Banach space, 4 is a linear manifold 
in #*, and M , is defined as above for every ordinal number a, then Mg is 
the weak-star closure of M, where Q is the first uncountable ordinal. More- 
over, there is an ordinal number a < Q such that H, = M 9. 


PROOF. By Corollary 12.7 it suffices to show that .Z, is weak-star sequen- 
tially closed. Let {x*} be a sequence in @, such that x* > x* (wk”*). 
Since #4) =U{.4,: a < 2}, for each n there is an a, < 92 such that 
xx €M,. But a = sup,a, < 9%. Hence x* © .@, for all n; thus x © Z,,, 
CM, and “4g is weak-star closed. 

To see that W,=.@, for some a < (2, let {x*} be a countable wk* 
dense subset of ball .W,. For each n there is an a, such that x* €.Z@,. Put 
a = sup,a,. So {x*} Cc ball .Z@,. Put ball .@, 1s a compact metric space in 
the weak-star topology, so {x*} is wk* sequentially dense in ball Zo. 
Therefore ball. 4, C ball.7%,,, and W.=.4, ,4. = 


When is .4 weak-star sequentially dense in 2% *? The following result of 
Banach answers this question. 


12.11. Theorem. Jf 2% is a separable Banach space and 4 is a linear 
manifold in &*, then the following statements are equivalent. 


(a) 4 is weak-star sequentially dense in &*. 
(b) There is a positive constant c such that for every x in 2, 


|x| < sup{|(x, x*)|: KPC ACI" || eh. 


(c) There is a positive constant c such that if x* © ball 2*, there is a 
sequence {xf} in M, \|xf|| < c, such that xf > x* (wk*). 


PROOF. It is clear that (c) implies (a). The proof will consist in showing that 
(a) implies (c) and that (b) and (c) are equivalent. 

(a) = (c): For each positive integer n, let A, = the wk* closure of 
n(ball .@). If x* © &*, let {xf} be a sequence in -4@ such that xf —> x* 
(wk*). By the PUB, there is an n such that ||x*|| <7 for all &. Hence 
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x* €A,. That is, U%_,A, = #*. Clearly each A, is norm closed, so the 
Baire Category Theorem implies that there is an A, that has interior in the 
norm topology. Thus there is an x* in A, and an r> 0 such that 
A, > {x* © #*: ||x* — xf|| <r}. Let (xf} C n(ball.Z) such that xf > 
x* (wk*). If x* © ball #*, then x* + rx* © A,; hence there is a sequence 
{ y*} in n(ball.@) such that y*# > xX + rx* (wk*). Thus r-'(y* — x) 
—> x* (wk*) and r7‘(y* — x*) € c(ball W%), where c = 2n/r is indepen- 
dent of x*. 

(c) => (b): If x € &, then Alaoglu’s Theorem implies there is an x* in 
ball ¥* such that (x,x*) = ||x||. By (c), there is a sequence {xf} in 
c(ball.#) such that x* > x* (wk*). Thus (x, x) — ||x|| and (b) holds. 

(b) = (c): According to (b), ball 2D °[c(ball .Z)]. Hence ball 2* = 
(ball Z)° C€ °[c(ball @)] °. By (1.8), °[c(ball #)]° = the weak-star closure 
of c(ball.#). But bounded subsets of &* are weak-star metrizable (5.1) 
and hence (c) follows. a 


EXERCISES 


1. Suppose % is a normed space and that the only hyperplanes 4 in %* such that 
M ball #* is weak-star closed are those that are weak-star closed. Prove that 
# is a Banach space. 


2. (von Neumann) Let A be the subset of /? consisting of all vectors { x,,,: 
1 <m<n< oo} where x,,,(m) = 1, x,,,(") = m, and x,,,(k) = Oif k # m,n. 
Show that 0 € wk — cl A but no sequence in A converges weakly to 0. 


3. Where were the hypotheses of the separability and completeness of 2% used in the 
proof of Theorem 12.11? 


4. Let % be a separable Banach space. If 4 is a linear manifold in 2* give 
necessary and sufficient conditions that every functional in wk* — cl.@ be the 
wk* limit of a sequence from 7%. 


5. Let & be a normed space and let 7 be a locally convex topology on % such that 
ball is #compact. Show that there is a Banach space Y such that % is 
isometrically isomorphic to ¥Y*. (Hint: Let Y= {x* E2*: x*|ballZ is Fcon- 
tinuous }.) 


§13*. Weak Compactness 


In this section, two results are stated without proof. These results are among 
the deepest in the study of weak topologies. 


13.1. The Eberlein—Smulian Theorem. Jf 2 is a Banach space and A C &, 
then the following statements are equivalent. 


(a) Each sequence of elements of A has a subsequence that is weakly conver- 
gent. 
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(b) Each sequence of elements of A has a weak cluster point. 
(c) The weak closure of A is weakly compact. 


The proof can be found in Dunford and Schwartz [1958], p. 430. The 
serious student should examine Chapter V of Dunford and Schwartz [1958] 
for several results not presented here as well as for some of the history 
behind the material of this chapter. 

The following is an easy consequence of the Eberlein—Smulian Theorem. 


13.2. Corollary. Jf 2 is a Banach space and A C4, then A is weakly 
compact if and only if A (4 is weakly compact for every separable subspace 
M of &. 


If & is a Banach space and A is a weakly compact subset of %, then for 
each x* in &* there is an x) in A such that |(x,, x*)| = sup{|(x, x*)|: 
x € A}. Itis a rather deep fact due to R. C. James [1964a] that the converse 
is true. 


13.3. James’s Theorem. If 2 is a Banach space and A is a closed convex 
subset of & such that for each x* in 2 * there is an x, in A with 


xo. x*)| = sup{I(x,x*)|: x © A}, 


then A is weakly compact. 


Another reference for a proof of this theorem as well as a number of 
other equivalent formulations of weak compactness and reflexivity is James 
[1964b]. Also, if 2 is only assumed to be a normed space in Theorem 13.2, 
the conclusion is false (see James [1971]). 

The next result, presented with proof, is also called the Krein—Smulian 
Theorem and must not be confused with the theorem of the preceding 
section. 


13.4. Krein—Smulian Theorem. Jf 2 is a Banach space and K is a weakly 
compact subset of #, then co(K ) is weakly compact. 


PROOF. Case 1: & is separable. Endow K with the relative weak topology; 
so M(K)= C(K)*. If » © M(K), define F: 2* > F by 


F(x*) = J&x x*\ du(x). 


It is easy to see that F, is a bounded linear functional on %* and 
Fill < llellsup{|[xl[: x © K }. 


13.5. Claim. Ti: 4 * — F is weak-star continuous. 
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By (12.8) it suffices to show that F, is weak* sequentially continuous. Let 
{x*} be a sequence in 2* such that x* > x* (wk*). By the PUB, 
M = sup,||x*|| < oo. Also, (x, x*) > (x,x*) for every x in K. By the 
Lebesgue Dominated Convergence Theorem, Fi(x*) = [(x, x) du(x) > 
F(X *). So (13.5) is established. 

By (1.3), F, © 2%. That is, there is an x, in 2 such that F(x*) = 


pe 


(x,,x*). Define T: M(K) > % by T(p) = x,,. 
13.6. Claim. T: (M(K ), wk*) > (2%, wk) is continuous. 


In fact, this is clear. If 4; > 0 weak* in M(K), then for each x* in 2*, 
x*|K © C(K). Hence (7T(p,),x*) = [(x,x*) du(x) > 0. 

Let A= the probability measures on K. By Alaoglu’s Theorem #¥ is 
weak* compact. Thus 7(#) is weakly compact and convex. However, if 
x © K, (T(6,), x*) = (x,x*); that is, T(6,) = x. So T(F) D K. Hence 
T(P) D> co(K ) and co( K ) must be compact. 

Case 2: & is arbitrary. Let {x,,} be a sequence in co(K ). So for each n 
there is a finite subset F, of K such that x, © co(F,). Let F = U%_,F, and 
let #4 =VF. Then K, = KN .-Z@ is weakly compact and {x,} e oe ). 
Since @ is separable, Case 1 implies that co( K,) is weakly compact. By the 
Eberlein—Smulian Theorem, there is a subsequence {x, } and an x in 
co( K, ) < co(K) such that X,, 2 x. Thus co(K ) is weakly compact. fz 


EXERCISES 
1. Prove Corollary 13.2. 


2. If & is a Banach space and K is a compact subset of %, prove that co( K) is 
compact. 


3. In the proof of (13.4), if A = the probability measures on K, show that T(P) 
= co( K ). 


4. Prove the Eberlein—Smulian Theorem in the setting of Hilbert space. 


CHAPTER VI 


Linear Operators on a Banach Space 


As has been said before in this book, the theory of bounded linear operators 
on a Banach space has seen relatively little activity owing to the difficult 
geometric problems inherent in the concept of a Banach space. In this 
chapter several of the general concepts of this theory are presented. When 
combined with the few results from the next chapter, they constitute 
essentially the whole of the general theory of these operators. 

We begin with a study of the adjoint of a Banach space operator. Unlike 
the adjoint of an operator on a Hilbert space (Section II.2), the adjoint of a 
bounded linear operator on a Banach space does not operate on the space 
but on the dual space. 


§1. The Adjoint of a Linear Operator 


Suppose 2 and ¥Y are vector spaces and 7: 2— ¥Y 1s a linear transforma- 
tion. Let &’ = all of the linear functionals of Y> F. If y’e Y’, then 
y’oT: 2-— F 1s easily seen to be a linear functional on 2%. That 1s, 
y’oTE€ &”’. This defines a map 


LOE = Ge 


by T’(y’) = y’eT. The first result shows that if 2 and Y are Banach 
spaces, then the map 7’ can be used to determine when T is bounded. 
Another equivalent formulation of boundedness is given by means of the 
weak topology. 
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1.1. Theorem. Jf 2 and Y are Banach spaces and T: &- ¥Y is a linear 
transformation, then the following statements are equivalent. 


(a) T is bounded. 
(b) T(Y*)CB*. 
(c) T: (4, weak) — (Y, weak) is continuous. 


PROOF. (a) => (b): If y* © Y*, then T’(y*) € @’; it must be shown that 
T'(y*) © €*. But |T(y*\x)| = y*eT(x)| = (T(x), y*)| < 
TCI y*l < [TI *l all. So Ty) & B*. 

(b) = (c): If {x;} is a net in 2 and x, — 0 weakly, then for y* in Y*, 
(T(x;), y*) = T’(y*\(x;) 2 0 since T’y*) Ee 2*. Hence T(x;) > 0 
weakly in &. 

(c) = (b): If y* © Y*, then y*o7T: &- F is weakly continuous by (c). 
Hence T’(y*) = p*oTEG2@* by (V.1.2). 

(b) = (a): Let y* © Y* and put x* = T’(y*). So x* € &* by (b). So 
if x © ball 2, |(T(x), y*)| = |x, x*)| < ||x*||. That is, sup{|(7(x), y*)|: 
x € ball 2} < o. Hence T(ball 2) is weakly bounded; by the PUB, 
T(ball 2) 1s norm bounded and so ||T|| < oo. a 


The preceding result is useful, though strictly speaking it is not necessary 
for the purpose of defining the adjoint of an operator A in @( 2%, ¥Y), which 
we now turn to. If Ad &€ @(%,¥Y) and y* € Y*, then y*¥oA = A y*)E 
4 *, This defines a map A*: ¥* > #*, where A* = A’|Y*. Hence 


1.2 (x, A*(y*)) = (A(x), y*) 


for x in & and y* in Y*. A®* is called the adjoint of A. 

Before exploring the concept let’s see how this compares with the defini- 
tion of the adjoint of an operator on Hilbert space given in § II.2. There is a 
difference, but only a small one. When # is identified with #*, the dual 
space of #, the identification is not linear but conjugate linear (if F = C). 
The isometry h L, of # onto #*, where L,(f)= (f,h), satisfies 
Lu, = aL,. Thus the definition of A* given in (1.2) above is not the same as 
the adjoint of an operator on Hilbert space, since in (1.2) A* is defined on 
@#* and not some conjugate-linear isomorphic image of it. In particular, if 
the definition (1.2) is applied to a matrix A acting on C% considered as a 
Banach space, its adjoint corresponds to the transpose of A. If C% is 
considered as a Hilbert space, then the matrix of A* is the conjugate 
transpose of the matrix of A. This difference will not confuse us but it will 
serve to explain minor differences that will appear in the treatment of the 
two types of adjoints. The first of these occurs in the next result. 


1.3. Proposition. If 2 and Y are Banach spaces, A,B € B(X,Y), and 
a,B € F, then (aA + BB)* = aA* + BB*. 
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Note the absence of conjugates. The proof is left to the reader. 

If A &€ B(Z,Y), then it is easy to see that A* € @(Y*, &*). In fact, if 
y* € ball Y* and x € ball %, then |(x, A*y*)| = |( Ax, y*)| < ||Ax|| < 
||A |]. Hence ||A*y*]] < ||A]| if y* © ball&*, so that ||A*|| < ||Al|. This 
implies that (A*)* = A** can be defined, 


Att TT ** we Y ** | 
(Att y= (x**, Aty™ 
for x** in ** and y* in Y*. 


Suppose x © @ and consider x as an element of 2** via the natural 
embedding of % into its double dual. What is A**(x)? For y* in Y*, 


(A**(x), y*) = (x, A*y*) 
= (Ax, y*). 
That is, A**|%= A. This is the first part of the next proposition. 


1.4. Proposition. [f 2 and Y are Banach spaces and A © B(2,Y), then: 


(a) A**|2= A; 

(b) |A*|| = |All 

(c) if A is invertible, then A* is invertible and (A*)~' = (A7~')*; 
(d) if & is a Banach space and B © BY, Z), then (BA)* = A*B*. 


PRooF. Part (a) was proved above. It was also shown that ||A*|| < ||A]j. 
Thus ||A**|| < ||A*||. So if x © ball 2%, then (a) implies that ||Ax|| = 
||A**x|| < ||A**|| < ||A*|]. Hence ||A]] < ||A*]]. 

The remainder of the proof is left to the reader. sl 


1.5. Example. Let (X, 92,4) and M,: L?(u) > L?(p) be as in Example 
1.2.2. If 1 <p < o and 1/p+1/q=1, then MS: L%p) > Lp) Is 
given by M¥f = of. That is, MX = Mg. 

1.6. Example. Let K and k be as in Example III.2.3. If 1 < p < oo and 
1/p + 1/q = 1, then K*: L%(p) > L%p) is the integral operator with 
kernel k*(x, y) = k(y, x). 


1.7. Example. Let X, Y, 7, and A be as in Example III.2.4. Then A”: 
M(Y) > M(X) is given by 
(A*u)(A) = w(r-*(4)) 
for every Borel subset A of X and every p in M(Y). 
Compare (1.5) and (1.6) with (II.2.8) and (II.2.9) to see the contrast 


between the adjoint of an operator on Banach space with the adjoint of a 
Hilbert space operator. 


1.8. Proposition. [f A = B(%,¥Y), then ker A* = (ran A)*+ and kerA = 
* (ran A*). 
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The proof of this useful result is similar to that of Proposition II.2.19 and 
is left to the reader. 
This enables us to prove the converse of Proposition 1.4c. 


1.9. Proposition. If A <= @(%,Y), then A is invertible if and only if A* is 
invertible. 


PRooF. In light of (1.4c) it suffices to assume that A* is invertible and show 
that A is invertible. By the Open Mapping Theorem, there is a constant 
c > 0 such that A*(ball Y*) D (x* © H*: ||x*|| < c}. Soif x € Z, then 


|| Ax|| = sup{|{ Ax, y*)|: y* © ball Y* } 
= sup{|(x, A*y*)|: y* © ball Y*} 
> sup{|(x,x*)|:x* ©X* and ||x*|| <c} 
= ¢||x]]. 
Thus ker A = (0) and ran A is closed. (Why?) On the other hand, (ran A) + 


= ker A* = (0) since A* 1s invertible. Thus ran A 1s also dense. This implies 
that A is surjective and thus invertible. bia 


This section concludes with the following useful result that seems to be 
somewhat unfamiliar to parts of the mathematical community. 


1.10. Theorem. Jf 2 and Y are Banach spaces and A © B(%,Y), then the 


following statements are equivalent. 


(a) ran A is closed. 
(b) ran A* is weak* closed. 
(c) ran A* is norm closed. 


PROOF. It is clear that (b) implies (c), so it will be shown that (a) implies (b) 
and (c) implies (a). Before this is done, it will be shown that it suffices to 
prove the theorem under the additional hypothesis that A is injective and 
has dense range. 

Let 2=cl(ran A). Thus A: 2— Z& induces a bounded linear map B: 
U/kerA > & defined by B(x + kerA) = Ax. If QO: F- B/kerA is the 
natural map, the diagram 


a Fray 


ON 7 B 
4/ker A 


commutes. (Why is B bounded?) It is easy to see that B is injective and that 
B has dense range. In fact, ran B = ran A, so ran A is closed if and only if 
ran B is closed. Let’s examine B*: 2#* — (2%/kerA)*. By (V.2.2), 
(%/ker A)* = (ker A) * = wk*cl(ran A*) C #* by (1.8). Also by (V.2.3), 
since P< Y, F* = Y*/F+t= Y*/(ran A)+ = Y*/ker A* by (1.8). Thus, 


B*: Y*/ker A* — (kerA)> . 
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1.11. Claim. B*( y* + ker A*) = A*y* for all y* in Y*. 


To see this, let x € @ and y* € &Y*, Making the appropriate identifica- 
tions as in (V.2.2) and (V.2.3) gives (x + ker A, B*(y* + ker A*)) 
= (B(x + ker A),y* + ker A*) = (Ax,y* + (ran A)~) = (Ax, y*) 
= (x, A*y*) = (x +> (ran A*), A*y*) = (x + ker A, A*y*). Since x was 
arbitrary, (1.11) is established. 

Note that Claim 1.11 implies that ran B* = ran A*. Hence ran A* is 
weak* (resp., norm) closed if and only if ran B* is weak* (resp., norm) 
closed. 

This discussion shows that the theorem is equivalent to the analogous 
theorem in which there is the additional hypothesis that A 1s injective and 
has dense range. It is assumed, therefore, that ker A = (0) and cl(ran A) = &. 

(a) = (b): Since ran A is closed, the additional hypothesis implies that A 
is bijective. By the Inverse Mapping Theorem, A~' € @(Y, #). Hence A* 
is invertible (1.4c). Since A* is invertible, ran A* = 2@* and hence is weak* 
closed. 

(c) = (b): Since ran A is dense in Y, ker A* = (ran A)~ (1.8) = (0). Thus 
A*: Y* — ran A* is a biyection. Since ran A* is norm closed, it is a Banach 
space. By the Inverse Mapping Theorem, there is a constant c > O such that 
|| A*y*|| > clly*|| for all y* in Y*. 

To show that ran A* is weak* closed, the Krein—Smulian Theorem 
(V.12.6) will be used. Thus suppose { A*y*} is a net in ran A* with 
|| A*y*|| < 1 such that A*y* > x*o(%*, 2%) for some x* in 2*. Thus 
\|y.*|| < c-* for all y*. By Alaoglu’s Theorem there is a y* in &* such that 
y* => y* 0( Y*,Y). Thus (1.1c), A*y* -g> A*y* o(%*, 4), and so 
x* = A*y* © ran A*. By (V.12.6), ran A* is weak* closed. 

(b) = (a): Since ran A* is weak* closed, ran A* = (ker A)*+ = 2*. Also, 
ker A* = (ran A)~+ = (0) since A has dense range. Thus A* is a bijection 
and is thus invertible. By Proposition 1.9, A is invertible and thus has 
closed range. = 


EXERCISES 

1. Prove Proposition 1.3. 

2. Complete the proof of Proposition 1.4. 
3. Verify the statement made in (1.5). 

4. Verify the statement made in (1.6). 

5. Verify the statement made in (1.7). 


6. Let 1 < p < o and define S: /? > 1” by S(a,, a5, ...) = (0, a, a5, ...). Com- 
pute S*. 
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7. Let A © @(c,) and for n > 1, define e, in cy by e,(n) = 1 and e,(m) = 0 for 
m#n. Put a,,, = (Ae,)(m) for m,n > 1. Prove: (a) M = sup,,L°°_,|a,,,,| < 0; 
(b) for every n, a,,, > 0 as m— oo. Conversely, if {a,,,,: m,n > 1} are scalars 


satisfying (a) and (b), then 
(Ax)(m) = Le nn x(n) 
n=1 


defines a bounded operator A on cy, and ||A|| = M. Find A*. 


8. Let A € @(I') and for n>1 define e, in /' by e,(n)=1, e,(m) = 0 for 
m#n. Put a,,,, = (Ae,)(m) for m,n > 1. Prove: (a) M = sup, _,|a,,,| < 003 
(b) for every m, sup,|q,,,,| < 00. Conversely, if {a,,,: m,n > 1} are scalars 


satisfying (a) and (b), then 
(Af)(1) = Le nn f(m) 
m=1 


defines a bounded operator A on /' and ||A|| = M. Find A*. 


9. (F. F. Bonsall) Let 2 be a Banach space, Z a nonempty set, and u: Z > @. If 
there are positive constants M, and M, such that (i) ||u(z)|| < M, for all z in Z 
and (ii) for every x* in %, sup {| < u(z),x* > |: z © Z} > M,||x*||; then for 
every x in & there is an f in /'(Z) such that (*)x = L{ f(z)u(z): z © Z} and 
M, inf |Ifll,_ < ||x|| < M, inf |I/|,, where the infimum is taken over all f in /'(Z) 


such that (*) holds. (Hint: define T: /'(Z) > & by Tf = X{ f(z)u(z): z € Z}.) 


10. (F. F. Bonsall) Let m be normalized Lebesgue measure on 0D and for |z| < 1 
and |w| = 1 let p,(w) = (1 — |z|?)/|1 — Zw’. So p, is the Poisson kernel. Show 
that if f © L'(m), then there is a sequence {z, } C D and a sequence {A,,} in /' 
such that (*)f = L_oA,, p,,. Moreover, |[fll, = infL¥_,|A,,|, where the infimum 


is taken over all {A,,} in /’ such that (*) holds. (Hint: use Exercise 9.) 


§2*. The Banach—Stone Theorem 


As an application of the adjoint of a linear map, the isometries between 
spaces of the form C(X) and C(Y) will be characterized. Note that if X 
and Y are compact spaces, t: Y — X is continuous map, and Af = for for 
f in C(X), then (III.2.4) A is a bounded linear map and ||A|| = 1. 
Moreover, A is an isometry if and only if 7 is surjective. If A is a surjective 
isometry, then +t must be a homeomorphism. Indeed, suppose A is a 
surjective isometry; it must be shown that 7 is injective. If y), y, € Y and 
Yo # Y;, then there is a g in C(Y) such that g(y,) = 0 and g(y,) = 1. Let 


f © CCX) such that Af = g. Thus f(1t(y)) = (yo) = 0 and f(7()y,)) = 1. 
Hence T( yo) # T( yj). 
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So if 7: Y > X is a homeomorphism and a: Y — F is a continuous 
function, with Ja(y)| = 1, then T: C(X)—> C(Y) defined by (7Tf)(y) = 
a( y)f(rCy)) is a surjective isometry. The next result gives a converse to this. 


2.1. The Banach—Stone Theorem. If X and Y are compact and T: C(X) > 
C(Y ) is a surjective isometry, then there is a homeomorphism +t: Y — X anda 
function a in C(Y) such that |a(y)| = 1 for all y and 


(Tf )(y) = a(y)f(7(y)) 
for all f in C(X) and y in Y. 


PROOF. Consider T*: M(Y) — M(X). Because T is a surjective isometry, 
T* is also. (Verify.) Thus 7* is a weak* homeomorphism of ball M(Y) 
onto ball M(_X) that distributes over convex combinations. Hence (Why?) 


T *(ext|ball M(Y)|) = ext|ball M@(X)]. 


By Theorem V.8.4 this implies that for every y in Y there is a unique T(y) 
in X and a unique scalar a( y) such that |a(y)| = 1 and 


T*(8,) 7 a(y)8,(,)- 


By the uniqueness, a: Y ~ F and rt: Y — X are well-defined functions. 
2.2. Claim. a: Y — F is continuous. 


If { y,} is a net in Y and y, > y, then 6, > 6, weak* in M(Y). Hence 
a(y,)8,.,,) = T*(6,) > T*(8,) = a(y)8,,,) Swe in M(X). In particular, 
a(y,) = (1, T*(6, yy = Gly T*(65, y= al) proving (2.2). 


2.3. Claim. tr: Y — X is a homeomorphism. 


As in the proof of (2.2), if y,> y in Y, then a(y;,)6,,,,) > a(y)6,,,, 
weak* in M(X). Also, a(y,) > a(y) in F by (2.2). Thus 6, = 
a(y;)~ La y,)8,,,)] > 8,4). By (V.6.1) this implies that 7(y,) > T(y), so 
that t: Y > X is continuous. 

If y;,¥. © Y and y, # yy, then a(y,)d, # a(y,)6,. Since T* is injec- 
tive, it is easy to see that r(y,) # T(y,) and so 7 is one-to-one. If x € X, 
then the fact that T* is surjective implies that there is a wp in M(Y) such 
that T*u = 6,. It must be that p © ext[ball M(X)] (Why?), so that up = £6, 
for some y in Y and B inF with |B| = 1. Thus 6, = T*(B8,) = Ba( y)8,,,)- 
Hence B = a(y) and 7+(y)= x. Therefore 7; Y — X is a continuous 
bijection and hence must be a homeomorphism (A.2.8). This establishes 
(2.3). 

If fec(x) and yey, then T(f)(y) = (7f,5,) = (f,T*5,) = 
(fra y)8¢)) = ay) f(y). 
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§3. Compact Operators 


The following definition generalizes the concept of a compact operator from 
a Hilbert space to a Banach space. 


3.1. Definition. If 2 and Y are Banach spaces and A: #—> ¥Y is a linear 
transformation, then A is compact if cl A(ball 2) is compact in Y. 


The reader should become reacquainted with Section II.4. 

It is easy to see that compact operators are bounded. 

For operators on a Hilbert space the following concept is equivalent to 
compactness, as will be seen. 


3.2. Definition. If 2 and Y are Banach spaces and A € @(2%,Y), then A 
is completely continuous if for any sequence {x,}in % such that x, > x 
weakly it follows that ||Ax, — Ax|| — 0. 


3.3. Proposition. Let Z and Y be Banach spaces and let A = B( X,Y). 


(a) If A is a compact operator, then A is completely continuous. 
(b) If & is reflexive and A is completely continuous, then A is compact. 


PROOF. (a) Let {x,,} be a sequence in 2 such that x, — 0 weakly. By the 
PUB, M = sup,]||x,]|| < oo. Without loss of generality, it may be assumed 
that M <1. Hence { Ax,,} € cl A(ball 2). Since A is compact, there is a 
subsequence {x,,} anda y in Y such that ||Ax, — y|| > 0. But x, — 0 
(wk) and A: (2%, wk) > (&, wk) is continuous (1.1c). Hence Ax, > A(Q) = 
0 (wk). Thus y = 0. Since 0 is the unique cluster point of { Ax,} and this 
sequence is contained in a compact set, ||Ax,|| — 0. 

(b) First assume that 2% is separable; so (ball 2, wk) is a compact metric 
space. So if {x,} 18 a sequence in ball 2 there is an x in 2% and a 
subsequence {x, } such that x, —> x weakly. Since A is completely con- 
tinuous, ||Ax,, — “Axl — 0. Thus A(ball 2) is sequentially compact; that 1s, 
Aisa compact operator. 

Now let 2 be arbitrary and let {x,} C ball Z. If 2, = the closed linear 
span of {x,}, then %, is separable and reflexive. If A, = A|%,, then A,: 
#, > ¥Y 1s easily seen to be completely continuous. By the first paragraph, 
A, is compact. Thus { Ax, } = { A,x,,} has a convergent subsequence. Since 
{x,,} was arbitrary, A is a compact operator. | 


The fact that in the proof of (3.3b), A(ball 2) was shown to be compact, 
and hence closed, is a consequence of the reflexivity of A. 

By Proposition V.5.2, every operator in @(/') is completely continuous. 
However, there are noncompact operators in @(/') (for example, the 
identity operator). 


178 VI. Linear Operators on a Banach Space 


There has been relatively little study of completely continuous operators 
that I am aware of. Most of the effort has been devoted to the study of 
compact operators and this is the direction we now pursue. 


3.4. Schauder’s Theorem. Jf A € @(2%,¥Y), then A is compact if and only if 
A*™ is compact. 


PROOF. Assume A 1s a compact operator and let { y,*} be a sequence in 
ball Y *. It must be shown that { A*y,*} has a norm convergent subsequence 
or, equivalently, a cluster point in the norm topology. By Alaoglu’s The- 
orem, there is a y* in ball&* such that y* -3> y* (weak*). It will be 
shown that A*y* —3> A*y* in norm. 

Let e > O and fix N > 1. Because A(ball 2) has compact closure, there 
are vectors y,,..., y,, in Y such that A(ball 7) CUP {ye &: lly — y, || 
< e/3}. Since y,* -3> y* (weak*), there is an n > N such that |¢ y,, 
y* — y*)| < €/3 for 1 < k <m. Let x be an arbitrary element in ball & 
and choose y, such that ||Ax — y,|| < ¢/3. Then 


Koa ye As) | = Any yD 


a KALS yay ya ey a, | 
< 2\|Ax — y,|| + €/3 < «. 


Thus ||A*y — A*y*|| < e. 

For the converse, assume A* is compact. By the first half of the proof, 
A*¥*: B** + Y** is compact. It is easy to check that A = A**|% 1s 
compact. a 


For Banach spaces and Y, &(%,¥Y) denotes the set of all compact 
operators from % into Y; (2) = BF, &). 


3.5. Proposition. Let %, Y, and & be Banach spaces. 


(a) B(%,Y) is a closed linear subspace of B( X,Y). 
(b) [fKEB(%,Y) andA€ BY, 2&), then AK € B(2, Z&). 
(c) fKEB(4%,Y) andA = BZ,2), then KA € B(Z,Y). 


The proof of (3.5) is left as an exercise. 


3.6. Corollary. If 2 is a Banach space, B)( 2 ) is a closed two-sided ideal in 
the algebra B( 2). 


Let @(%,¥Y) = the bounded operators T: 2-— Y for which ranT is 
finite dimensional. Operators in &(%,¥%) are called operators with finite 
rank. It is easy to see that @)(2%,Y) € B(#, Y) and by (3.5a) the closure 
of By (X,Y) is contained in By(%,Y). Is Boy (4%, Y) dense in By( 4, Y)? 
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It was shown in (II.4.4) that if # is a Hilbert space, then @)(#) is 
indeed the closure of #(#). Note that the ability to find an orthonormal 
basis in a Hilbert space played a significant role in the proof of this 
theorem. There is a concept of a basis for a Banach space called a Schauder 
basis. Any Banach space 2 with a Schauder basis has the property that 
Bo (4) is dense in & (2). Enflo [1973] gave an example of a separable 
reflexive Banach space 2 for which #,,(%) is not dense in #(%), and, 
hence, has no Schauder basis. Davie [1973] and [1975] have simplifications 
of Enflo’s proof. For the classical Banach spaces, however, every compact 
operator is the limit of a sequence of finite-rank operators. 

The remainder of this section is devoted to proving that for X compact, 
Boy(C(X)) is dense in 4,(C(X)). This begins with material that may be 
familiar to many readers but will be presented for those who are un- 
acquainted with it. 


3.7. Definition. If X is completely regular and #C C(X), then F is 
equicontinuous if for every e > 0 and for every x, in X there is a neighbor- 
hood U of x, such that |f(x) — f(x,)| < e for all x in U and for all f 
in F. 


Note that for a single function f in C(X), #= { f} is equicontinuous. 
The concept of equicontinuity states that one neighborhood works for all f 
in F. 


3.8. The Arzela—Ascoli Theorem. Jf X is compact and FC C(X), then F¥ is 
totally bounded if and only if F is bounded and equicontinuous. 


PROOF. Suppose F is totally bounded. It is easy to see that FY is bounded. 
If e > 0, then there are f,,...,f, in #” such that AC UZ_i{f © CCX): 
If — fl] < €/3}. If x9 © X, let U be an open neighborhood of x, such that 
forl <k <nand x inJU, |f,(x) — f,(%o)| < €/3. If fe F, let f, be such 
that ||f — f,|| < ¢/3. Then for x in U, 


f(x) —f(xo)l s Ie) ~ AO) + ix) — A (Xo) 
+ If, (Xo) — f(x) 


< €. 


Hence ¥ is equicontinuous. 

Now assume that ¥ is equicontinuous and FC ballC(X). Let « > 0. 
For each x in 2%, let U. be an open neighborhood of x such that 
f(x) — f(y)| < e/2 for fin A and y nU.. Now {U,: x © X} is an open 
covering of X. Since X is compact, there are points x,,...,x, in X such 
that X = =U, . 

Let {a,,...,a,,} CD such that clD C UZ_,{a: |a — a,| < €/2}. Let 
B = all ordered n-tuples of scalars (8,,...,8,) such that {B,,...,8,} C 
{a,,...,a,,}. (So B has m” elements.) Let $,,...,, be a partition of unity 
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subordinate to the cover {U,,...,U, } (V.6.5). For b = (B,...,8,) in B, 
let 8b ~ u' -1B;%;- 


3.9. Claim. FC U,< pf f: || f— gall < e}. 
Note that (3.9) implies that ¥ is totally bounded. 

For f in F, { f(x,),...,f(x,)} € clD. Pick b = (f,,...,8,) in B such 
that |B, — f(x,)| < ¢/2 forl <j <n. If x © X, then d $x) = 1 and so 


(x) ~ (al = 9) ~ © 895) 


Le B,| $,(x) 


ee 


< Evo - Byl6,(x). 


Now if ¢(x) > 0, x © U, and so [f(x) — Bl < If(x) — f(x,)| + I(x) - 
B| < e. Hence |f(x) — g,(x)| < « for all x in X. That is, ||f— g,|| < «. 
|| 


3.10. Corollary. [f X is compact and FC C(X), then F is compact if and 
only if F is closed, bounded, and equicontinuous. 


3.11. Theorem. If X is compact, then By(C(X)) is dense in By(C(X)). 


Proor. Let T € &,(C(X)). Thus T(ball C(X)) is bounded and equicon- 
tinuous by the Arzela—Ascoli Theorem. If e > 0 and x € X, let U. be an 
open neighborhood of x such that |(7f)(x) — (7f)(y)| < e for all f in 
ballC(X) and y in U,. Let {x,,...,x,} © X such that X C U%_U_,. Let 
{o,,.--,, } be a partition of unity subordinate 105 Oh 2050. fs Define I ie 
C(X) > C(X) by 


Lf= YL (T\(x)o, 


Since ran7T. C V{9,,...,6,}, T, © Bo (C(*X)). 
If f € ball CCX) and x © X, then 


(T.f)(x) -(TA)(x)) (TF )(x,) —(TA)(x)] (9) 


A 


< LUM) -(T)1e,(2) 


by an argument like the one used to prove (3.9). = 


VI.3. Compact Operators 181 


If X is locally compact, then the operators on C)(X) of finite rank are 


dense in &,(C)( X)). See Exercise 18. 


EXERCISES 


1 
2 
3 
4. 
5 
6 


10. 


ll. 


12: 


13. 
14. 
15. 


16. 


. If & is reflexive and A € @(%,Y), show that A(ball %) is closed in Y. 
. Prove Proposition 3.5. 


_ If A ©€ &(4,Y), show that cl[ran A] is separable. 


If A € B(&%,Y) and ran A is closed, show that ran A is finite dimensional. 


. If A € & (2) and A is invertible, show that dim 2°< oo. 
. Let (X, 2, w) be a finite measure space, 1 < p < 00, and1/p+1/g=1.If k: 


X X X > F is an 22 X 92-measurable function such that sup{ {|A(x, y)|7 du(y): 
x © X} < oo, then (Kf)(x) = {k(x, y) f(y) du(y) defines a compact operator 
on L?(). 


. Let (X, 2, ») be an arbitrary measure space, 1 < p < oo, and1/p + 1/q=1.If 


k: XX X > F is an Q X Q“-measurable function such that M = 


[C/K (x, y)P d(x)? du(y)]'/4 < 00 and if (Kf)(x) = fk(x, y) f(y) du(y), 
then K € &(L?(p)) and ||K|| < M. 


. Let X be a compact space and let p be a positive Borel measure on X. Let 


T © B(L?(p), C(X)) where 1 < p < oo. Show that if A: L?(p) > L?(p) is 
defined by Af = Tf, then A is compact. 


. (B. J. Pettis) If % is reflexive and T € @(#, I"), then T is a compact operator. 


Also, if Y is reflexive and T € @(cy, Y), T is compact. 


If X is compact and {f,,...,f,,815---,8,}€ C(X), define k(x, y)= 
vii f;(x)g;(y) for x, y © X. Let » be a regular Borel measure on X and put 
Kf(x) = fk(x, y) f(y) du(y). Show that K € @(C(X)) and K has finite rank. 


If X is compact, k € C(X X X), and wp is a regular Borel measure on X, show 
that Kf(x) = fk(x, y) f(y) du(y) defines a compact operator on C(X). 


Let (X,0,u) be a o-finite measure space and for ¢ in L*(p) let M;: 
L?(p) > L?(p) be the multiplication operator defined in Example III.2.2. Give 
necessary and sufficient conditions on (X, §2,) and ¢ for M, to be compact. 


Let 7: [0,1] — [0,1] be continuous and define A: C[0,1] ~ C[0,1] by Af = for. 
Give necessary and sufficient conditions on + for A to be compact. 


Let A € &(c,) and let (4a,,,,) be the corresponding matrix as in Exercise 1.7. 
Give necessary and sufficient conditions on (a,,,,) for A to be compact. 


Let A © &(I') and let (a,,,,) be the corresponding matrix as in Exercise 1.8. 
Give a necessary and sufficient condition on (a,,,,) for A to be compact. 


mn 


mn 


If (X,d) is a compact metric space and F C C(X), show that F is equicon- 
tinuous if and only if for every « > 0 there is a 6 > 0 such that |f(x) — f(y)| < e 
whenever d(x, y) < 6 and fe F. 
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17. If X is locally compact and ¥C C)(X), show that F is totally bounded if and 
only if (a) F is bounded; (b) # is equicontinuous; (c) for every e > O there is a 
compact subset K of X such that |f(x)| < e« for all f in A and x in X¥\ K. 


18. If X is locally compact and A € 4)(C(X)), then there is a sequence { A, } of 
finite-rank operators such that ||A, — A]| — 0. 


19. Let 2 be a Banach space and suppose there is a net {F} of finite-rank 
operators on % such that (a) sup,||F;|| < 00; (b) ||R.x — x|| > 0 for all x in 2. 
Show that if A € &(#), then ||F.A — A|| — 0 and hence there is a sequence 
{A,,} of finite-rank operators on % such that ||A,,— A|| — 0. 


20. Let1l < p < o and let (X, 2,1) be a o-finite measure space. If A € B)(L?(p)), 
show that there is a sequence { A, } of finite-rank operators such that ||A,, — A]| 
— (0. (Hint: Use Exercise 19.) 


21. Let X be compact and let WY be the collection of all pairs (C, F) where 
C = {U,,...,U,} is a finite open cover of X and F = {x,,...,x,} C X such 
that x, © U, for l<j<n. If (C,,F,) and (G,F,)€ &, define (C,, Fi) < 
(C,, F,) to mean: (a) C, is a refinement of C,; that is, each member of C, is 
contained in some member of C,. (b) FA, CK. If a=(C,F)eE@ let 
{¢,,..-59,} be a partition of unity subordinate to C. If F = {x,,...,x,}, 
define 7,: C(X) ~ C(X) by 


(Tf )(x) = S f(x;) (x). 
j=l 
Then: (a) T, € Bo (C(X)); (b) ||T, || = 1; (© (&, <) is a directed set and { 7): 
a € WM} is a net; (d) ||T,f — fll ~ 0 for each f. Now apply Exercise 19 to 
obtain a new proof of Theorem 3.11. 


§4. Invariant Subspaces 


4.1. Definition. If 2% is a Banach space and 7 © #(2), an invariant 
subspace for T is a closed linear subspace 4 of 2 such that Tx © .4@ 
whenever x €.4. @ is nontrivial if 7 # (0) or &. Lat T = the collection 
of all invariant subspaces for 7. If “Cc #(2), then Lat w= f\{Lat T: 
TED}. 


This generalizes the corresponding concept of invariant subspace for an 
operator on Hilbert space (11.3.5). Note that the idea of a reducing subspace 
for an operator on a Hilbert space has no generalization to Banach spaces 
since there is no concept of an orthogonal complement in Banach spaces. 


4.2. Proposition. 


(a) If M,,M@,€ LatT, then 4,V M,= lM, +4,) © LatT and 4, 
NM, =M, OM, &€ LatT. 
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(b) If {4 1&1} Cc LatT, then V{.4,: i€ I}, the closed linear span of 
U.4,, and N{M@;:i © 1} =), belong to Lat T. 


The proof of this proposition is left as an exercise. The proposition, 
however, does justify the use of the symbol “Lat” to denote the collection of 
invariant subspaces. With the operations V and A, LatT 1s a lattice (a) 
that is complete (b). Moreover, Lat7 has a largest element, 2%, and a 
smallest element, (0). 

The main question is: does Lat T have any elements besides (0) and 2? 
In other words, does T have a nontrivial invariant subspace? C. J. Read 
[1984] has given an example of a bounded operator on /' that has no 
nontrivial invariant subspaces. This deep work does not completely settle 
the matter. Which Banach spaces 2 have the property that there is a 
bounded operator on 2% with no nontrivial invariant subspaces? If 2 is 
reflexive, is Lat 7 nontrivial for every T in &(2%)? The question is un- 
answered even if % is a Hilbert space. However, for certain specific 
operators and classes of operators it has been shown that the lattice of 
invariant subspaces is not trivial. In this section it will be shown that any 
compact operator has a nontrivial invariant subspace. This will be obtained 
as a corollary of a more general result of V. Lomonosov. But first some 
examples. 


4.3. Example. If % is a finite-dimensional space over C and T€ &(2), 
then Lat 7 is not trivial. In fact, let 2@= C?% and let T = a matrix. Then 
p(z) = det(T — zI) is a polynomial of degree d. Hence it has a zero, say a. 
If det(T — al) = 0, then (T — a/) is not invertible. But in finite-dimen- 
sional spaces this means that 7 — al is not injective. Thus ker(T — al) # 
(O). Let @ < ker(T — al) such that 4 # (0). If x © 4, then Tx = ax © 
M,S0o UM © Lat T. 
4.4. Example. If T = |° on R’, then LatT is trivial. Indeed, if 
Lat7T is not trivial, there is a one-dimensional space 4 in LatT. Let 
M = {ae: a &R}. Since WE LatT, Te = Ae for some A in R. Hence 
T*e = T(Te) = ATe = ’e. But T”? = —I, so —e = Ne and it must be that 
\? = —1 if e # 0. But this cannot be if A is real. 

If d > 3, however, and T © @(R“), then Lat T is not trivial (Exercise 6). 


4.5. Example. If V: L?[0,1] > L*[0,1] is the Volterra operator, Vf(x) = 

of(t) dt, and0 <a<1,put 7, = {fe L’[0,1]: f(t) = Ofor0 <t < @}. 
Then 4, © Lat V. Moreover, it can be shown that LatV = {.7,:0<a< 
1}. (See Donoghue [1957], and Radjavi and Rosenthal [1973], p. 68). 


4.6. Example. If S: /? > /? is defined by S(a,,a,, ...) = (O,a,, a5, ...), 
and @, = {x el’: x(k)=0forl <k <n}, then 4%, € LatS. 
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4.7. Example. Let (X,0,) be a o-finite measure space and for @ in 
L™() let M, denote the multiplication operator on L?(p), 1 < p < oo. If 
AEQ, let W,={fEeL?(p): f=0 ae. [pw] off 4}. Then for each ¢ in 
L*(n), M, € Lat My. 


It is a difficult if not impossible task to determine all the invariant 
subspaces of a specific operator. The Volterra operator and the shift 
operator are examples where all the invariant subspaces have been de- 
termined. But there are multiplication operators M, for which there 1s no 
characterization of Lat M, as well as some M, for which such a characteri- 
zation has been achieved. One such example follows: let ~» = Lebesgue area 
measure on D and let (Af)(z) = zf(z) for f in L*(w). There is no known 
characterization of Lat A. 

It is necessary at this point to return to the geometry of Banach spaces to 
prove the following classical theorem. 


4.8. Mazur’s Theorem. If 2 is a Banach space and K is a compact subset of 
#, then co(K ) is compact. 


PRooF. It suffices to show that co(K) is totally bounded. Let e > 0 and 
choose x,,...,x, in K such that K C U"_,B(x,; 2/4). Put C= 
co{x,,...,X, }. It is easy to see that C is compact. Hence there are vectors 
ctied in such that C CU”, B();; €/4). If w © co( K ), there is a z in 
co(K) with ||w — z2|| < &/4. Tis = vi -10,kK,, where k, © K, a, > 0, 
and La, = 1. Now for each k, there is an x;,,) with ||k, — xj, p)ll < 6/4. 
Therefore 
/ 

Ly &Xj¢p) 
p=l 


1 
zs a,(k, = as) 
p=l 


l 


IA 
i 1 


a,||K = X ic pyll 


< 6/4. 


EC so there is a y, with ||L,a,x,,)— yil| < €74. The 
triangle inequality now shows that co(K ) C U7, B(y,; €) and so co(K ) is 
totally bounded. a 


The next result is from Lomonosov [1973]. When it appeared it caused 
great excitement, both for the strength of its conclusion and for the 
simplicity of its proof. The proof uses Schauder’s Fixed-Point Theorem 
(V.9.5). 


4.9. Lomonosov’s Lemma. If / is a subalgebra of B( 2) such that 1 © 
and Lat #= {(0), 2%} and if K is a nonzero compact operator on X, then 
there is an A in & such that ker(AK — 1) # 0. 
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PROOF. It may be assumed that ||K|| = 1. Fix x) in 2 such that ||Kx || > 1 
and put S = {x © Z: ||x — xo|| < 1}. It is easy to check that 


4.10 0¢Sand0 €clK(S). 


Now if x € & and x # 0, cl{Tx: T © #} is an invariant subspace for ¥ 

(because . is an algebra) that contains the nonzero vector x (because 

1 © ¥#). By hypothesis, cl{Tx: T€ ¥%} = X. By (4.10) this says that for 

every y in cl K(S) there is a T in & with ||Ty — xo|| < 1. Equivalently, 
AK(S)C U {y: IITy — xoll < 1). 


TES 


Because cl K(S') is compact, there are 7,,..., 7, in & such that 


4.11 dAK(S)¢ U {y: IG» — xoll < 1}. 
j=l 
For y in cl K(S) and 1 <j <n, let a,(y) = max{0,1 — ||T,y — xoll}. 
By (4.11), X_,4,(y) > 0 for all y in cl K(S). Define 5: cl K(S) > R by 


z a,(y) 


and define y: S > & by 


v(x) = Db Kx)TKx. 
j=l 
It is easy to see that a;: cl K(S) — [0,1] is a continuous function. Hence b; 
and y are continuous. 

If xe S, then Kx © K(S). If b(Kx)> 0, then a,(Kx)>0 and so 
|| F;Kx — xol| < 1. That is, 7;Kx € S whenever b,( Kx) > 0. Since S is a 
convex set and 1” _,b,(Kx) = 1 for x in S, 


W(S)CS. 


Note that 7;K © @)(2) for_each j so that U%_,7,;K(S) has compact 
closure. By Mazur’s Theorem, co(U’_,7,K(S)) is compact. But this convex 
set contains ¥(S) so that cl y(S) is compact. That is, y is a compact map. 
By the Schauder Fixed-Point Theorem, there is a vector x, in S such that 
W(X) = x). 

Let B; = b,(Kx,) and put A = 2" _,B7,. So A © ¥ and AKx, = p(x) 
= x,. Since x, # 0 (Why?), ker(AK — 1) # (0). B 


4.12. Definition. If T < 4(Z), then a hyperinvariant subspace for T is a 
subspace “4 of 2 such that AMC.M for every operator A in the 
commutant of 7,{7 }’; that is, AW CM whenever AT = TA. 


Note that every hyperinvariant subspace for T is invariant. 
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4.13. Lomonosov’s Theorem. Jf T <= @(%), T is not a multiple of the 
identity, and TK = KT for some nonzero compact operator K, then T has a 
nontrivial hyperinvariant subspace. 


Proor. Let #%= {T}’. We want to show that Lat ~# {(0), %}. If this is 
not the case, then Lomonosov’s Lemma implies that there is an operator A 
in & such that W= ker(AK — 1) # (0). But We Lat(AK) and AK|” is 
the identity operator. Since AK € (2%), AK|VW€ B&B (NV). Thus dim VW 
< oo. Since AK € W= {T}’, for any x in YW, AK(Tx) = T(AKx) = Tx; 
hence TV C MM. But dim W< oo so that T|W must have an eigenvalue 4X. 
Thus ker(T — A) = @# (0). But #4+# Z since T is not a multiple of the 
identity. It is easy to check that -@ is hyperinvariant for T. Es 


4.14. Corollary. (Aronszajn-Smith [1954].). If K € (2), then Lat K is 
nontrivial. 


The next result appeared in Bernstein and Robinson [1966], where it 1s 
proved using nonstandard analysis. Halmos [1966] gave a proof using 
standard analysis. Now it is an easy consequence of Lomonosov’s Theorem. 


4.15. Corollary. If 2 is infinite dimensional, A © @(2&), and there is a 
polynomial in one variable, p, such that p(A) © &)(2), then LatA is 
nontrivial. 


PRooF. If p(A) # 0, then Lomonosov’s Theorem applies. If p(A) = 0, let 
p(z) = @ + az + ++: +a,z", a, #0. For x #0, let @= 
V{x, Ax,..., A" 1x}. Since A” = —a,"[ag + ajA + --- +a,_,A” ‘x], 
M © Lat A. Since x € YW, M# (0); since dim 4 < 0, WHE. a 


4.16. Corollary. Jf K,,K, <= &@(2#) and K,K,=K,K,, then K, and K, 
have a common nontrivial invariant subspace. 


EXERCISES 
1. Let A, B, T € @(&) such that TA = BT. Show that graph (7) € Lat(A © B). 


2. Prove that 4 € Lat T if and only if 4+ © Lat T*. What does the map 47> 4* 
of Lat T into Lat T* do to the lattice operations? 


3. Let {e,,e,,e;} be the usual basis for F* and let a,,a,,a, € F. Define T: 
F? + F° by Te, =aj,e,,1 <j < 3. (a) If a,,a), a; are all distinct, show that 
M «© Lat T if and only if 4 = VE, where E C {e,, e3,e3;}. (b) If a, = a, # ay, 
show that 4 € LatT if and only if Z=WV+ 2, where W< V{e,,e,} and 


L< {ae,;, a € F}. 
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4. Generalize Exercise 3 by characterizing Lat T, where T is defined by Te, = a,e,, 
1 <j <d, for any choice of scalars a,,...,a@, and where {e,,...,e,} is the 


usual basis for F 2. 


5. Let {e),..., 4} be the usual basis for F%, let {a,,...,a,_,} CF. If Te, =a,e,,, 
forl <j < d—1 and Te, = 0, find Lat T. 


6. If T © BR“) and d > 3, show that T has a nontrivial invariant subspace. 


7. Show that if T€ @(#) and Z& is not separable, then T has a nontrivial 
invariant subspace. 


8. Give an example of an invertible operator T on a Banach space 2 and an 
invariant subspace 4 for T such that 4 is not invariant for T~!. 


9. Let K € @)(#) and show that if @ is a maximal chain in Lat K, then @ is a 
maximal chain in the lattice of all subspaces of 2. 


§5. Weakly Compact Operators 


5.1. Definition. If 2 and Y are Banach spaces, an operator T in @(%,Y) 
is weakly compact if the closure of T(ball 2) is weakly compact. 


Weakly compact operators are generalizations of compact operators, but 
the hypothesis is not sufficiently strong to yield good information about 
their structure. 

Recall that in a reflexive Banach space the weak closure of any bounded 
set is weakly compact. Also, a bounded operator T: &—> Y is continuous if 
both 2 and Y have their weak topologies (1.1). With these facts in mind, 
the proof of the next result becomes an easy exercise for the reader. 


5.2. Proposition. 


(a) If either X or ¥ is reflexive, then every operator in @(%,Y) is weakly 
compact. 

(b) If T: 2— Y is weakly compact and A © B(Y, &), then AT is weakly 
compact. 

(c) If T: &— Y is weakly compact and B © B(#, 2), then TB is weakly 
compact. 


This proposition shows that assuming that an operator is weakly compact 
is not that strong an assumption. For example, if % is reflexive, every 
operator in &@(2) 1s weakly compact. In particular, every operator on a 
Hilbert space is weakly compact. So any theorem about weakly compact 
operators is a theorem about all operators on a reflexive space. 

In fact, there is a degree of validity for the converse of this statement. In a 
certain sense, theorems about operators on reflexive spaces are also theo- 
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rems about weakly compact operators. The precise meaning of this state- 
ment is the content of Theorem 5.4 below. But before we begin to prove 
this, a lemma is needed. 

Let Y be a Banach space and let W be a bounded convex balanced 
subset of Y. For n > 1 put U, = 2".W +27 “int[ball Y]. Let p, = the gauge 
of U, (IV.1.14). Because U, > 2~"int[ball Y], it is easy to check that p, is a 
norm on &. In fact, p, and || - || are equivalent norms. To see this note that 
if |ly|| < 1, then2~”y € U, so that p,(y) < 2”. Hence p,(y) < 2"||y|]. Also, 
because W is bounded, U, must be bounded; let M > sup{||y||: y € U,}. 
So if p,(y) <1, |lyll| <M. Thus ||y|| < Mp,(y), and || - || and p, are 
equivalent norms. 


5.3. Lemma. For a Banach space Y let W, U,, and p,, be as above. Let 
® = the set of all y in Y such that |\\y||| = [Z%_,p,(y)7]'7 < 0c. Then 


(a) WC {y: liblll < 1); 

(b) (&, ||| - |||) is a@ Banach space and the inclusion map A: R-> ¥ is 
continuous; 

(c) A**: B** + Y** js injective and (A**) (Y) = &; 

(d) & is reflexive if and only if clW is weakly compact. 


PrRoorF. (a) If w € W, then 2”w © U,. Hence 1 > p,(2"w) = 2"p,(w), so 
p,(w) < 27". Thus |||]? < £,(27")? <1. 

(b) Let Y, = Y with the norm p, and put T= ©,Y, (III.4.4). Define ®: 
R->* by ®(y)=(), y, ...). It is easy to see that ® is an isometry, 
though it is clearly not surjective. In fact, ran® = {(y,) & 2%: y, =y,, for 
all n,m}. Thus # is a Banach space. Let P, = the projection of 2% onto the 
first coordinate. Then A = P,° ® and hence A is continuous. 

(c) With the notation from the proof of (b), it follows that 7 ** = @,Y** 
and @**: Q** +» X** is given by O**( y**) = (A**y**, A**y**, ...). 
Now the fact that ® is an isometry implies that ®* is surjective. (This 
follows in two ways. One is by a direct argument (see Exercise 2). Also, 
ran®* is closed since ran® is closed (1.10), and ran®* is dense since 
+ (ran®*) = ker ® = (0).) Hence ker ®** = (ran@*)~ = (0); that is, &** 
is injective. Therefore A** is injective. 

Now let y** € A**~1(Y). It follows that ®**y** = x € #. Let { y,} be 
a net in & such that |ly,|| < ||y**|| for all i and y, > y** o0( 2**, B*) 
(V.4.1). Thus ®**(y,) > B**(y**) o( &**, X*). But OF *(y,) = O(y,) EF 
and ®**( y**) = x. Hence ®(y,) > x 0( %,%*). Since ran® is closed, 
x © ran®; let ®(y) = x. Then 0 = ©**( y** — y). Since @** is injective, 
yp**¥ =ypER4. 

(d) An argument using Alaoglu’s Theorem shows that A**(ball 2**) = 
the o(Y **, Y*) closure of A(ball #). Put C = A(ball #). Suppose clW is 
weakly compact. Now CC 2"c1W + 27~"ball Y¥** and this set is 
a(Y**,Y*) compact. From the preceding paragraph, A**(ball 2**) C 
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2"clIW + 2° "ball Y **. Thus, 


A**(ball B**) © () [2"clW + 2>-"ball Y**] 


n=l1 


IN 


C1) [W+ 2-"ball Y **] 
n=l] 


= Y, 


By (c), A** =A and & 1s reflexive. 
Now assume & is reflexive; thus ball # is o(&, #*)-compact. Therefore 
C = A(ball 2) is weakly compact in Y. By (a), clW is weakly compact. 
a 


The next theorem, as well as the preceding lemma, are from Davis, Figel, 
Johnson, and Pelczynski [1974]. 


5.4. Theorem. If 2,Y are Banach spaces and T © B(%,Y), then T is 
weakly compact if and only if there is a reflexive space @ and operators A in 
B(AR,Y) and Bin B(X, B®) such that T = AB. 


PROOF. If J = AB, where A, B have the described form, then T is weakly 
compact by Proposition 5.2. 

Now assume that T is weakly compact and put W = T(ball 2). Define # 
as in Lemma 5.3. By (5.3d), & is reflexive. Let A: # > Y be the inclusion 
map. Note that if x © ball@, then Tx © W. Hence 2”Tx € U, and so 
1 > p,(2"Tx) = 2"p,(Tx). Thus p,(Tx) < 2~" for x in ball &. Hence if 
lIx|] <1, [I] Tx|Il* = 2, p,(Tx)? < 47>" = c. So B: ¥> Z defined by Bx = 
Tx is a bounded operator. Clearly AB = T. a 


The preceding result can be used to prove several standard results from 
antiquity. 


5.5. Theorem. [f %,¥Y are Banach spaces and T = @(2%,Y), the following 
statements are equivalent. 


(a) T is weakly compact. 
(b) Fe*( 2 8") Coy, 
(c) 7* is weakly compact. 


PROOF. (a) = (b): Let @& be a reflexive space, A € @(F,Y), and BE 
B(*#,#) such that T= AB. So T** = A**B**, But A**: Bo Y** 
since Z** = Y. Hence A** = A. Thus T** = AB**, and so ranT** C 
ran A Cc Y. 

(b) = (a): T **(ball **) is o( Y**, Y*) compact by Alaoglu’s Theorem 
and the weak* continuity of T**. By (b), T **(ball & **) = C is o( Y, Y*) 
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compact in Y. Hence T(ball 2%) CC and must have weakly compact 
closure. 

(c) => (a): Let Y be a reflexive space, CE @(Y*, Sf), DEBS, X*) 
such that T* = DC. So T** = C*¥D*, D*: &** —> F*, and C*: S* > 
Yy** Put A=cl D*(#) and B= D*|%; then B: Z->G& and Z is 
reflexive. Let A = C*|R; so A: R> YW**, But if x EX, ABx = C*D*x 
= T**x = Tx € Y. Thus A: 2- Y. Clearly AB = T. 

(a) = (c): Exercise. hg 


EXERCISES 
1. Prove Proposition 5.2. 


2. If @ and & are Banach spaces and ®: # — & is an isometry, give an elementary 
proof that ®* is surjective. 


3. Let 2 be a Banach space and recall the definition of a weakly Cauchy sequence 
(V.4.4). (a) Show that every bounded sequence in cy has a weakly Cauchy 
subsequence, but not every weakly Cauchy sequence in cg converges. (b) Show 
that if T © @(c,) and T is weakly compact, then T is compact. 


4. Say that a Banach space 2 is weakly compactly generated (WCG) if there is a 
weakly compact subset K of % such that 2 is the closed linear span of K. Prove 
(Davis, Figel, Johnson, and Pelczynski, [1974]) that 2 is WCG if and only if 
there is a reflexive space and an injective bounded operator T: # — & such that 
ranT is dense. (Hint: The Krein-Smulian Theorem (V.13.4) may be useful.) 


5. If (X,Q2,p) is a finite-measure space, k © L*(X X X,92 X Q,p X pw), and K: 
Li(n) > L'(w) is defined by (Kf)(x) = [k(x, y) f(y) du(y), show that K is 
weakly compact and K* is compact. 


6. Let Y be a weakly sequentially complete Banach space. That is, if { y,} is a 
sequence in Y such that {(y,, y*)} is a Cauchy sequence in F for every y* in 
Yy*, then there is a y in Y such that y, > y weakly [see (V.4.4)]. (a) If 
Te B#,Y) and x** © #** such that x** is the o(#**,%*) limit of a 
sequence from 2, show that T**(x**) € Y. Let X be a compact space and put 
F = all subsets of X that are the union of a countable number of compact G; 
sets. Let &= the linear span of {x;-: F © ¥F} considered as a subset of 
M(X)* = C(X)**. (b) Show that if T € @(C(X),Y), then T**(L) CY. (c) 
(Grothendieck [1953].) If T€ @(C(X),Y), then T is weakly compact. [Hint 
(Spain [1976]): Use James’s Theorem [(V.13.3)]. 


CHAPTER VII 


Banach Algebras and Spectral Theory for 
Operators on a Banach Space 


The theory of Banach algebras is a large area in functional analysis with 
several subdivisions and applications to diverse areas of analysis and the 
rest of mathematics. Some monographs on this subject are by Bonsall and 
Duncan [1973] and C. R. Rickart [1960]. 

A significant change occurs in this chapter that will affect the remainder 
of this book. In order to prove that the spectrum of an element of a Banach 
algebra is nonvoid (Section 3), it is necessary to assume that the underlying 
field of scalars F is the field of complex numbers C. It will be assumed from 
Section 3 until the end of this book that all vector spaces are over C. This 
will also enable us to apply the theory of analytic functions to the study of 
Banach algebras and linear operators. 

In this chapter only the rudiments of this subject are discussed. Enough, 
however, is presented to allow a treatment of the basics of spectral theory 
for operators on a Banach space. 


§1. Elementary Properties and Examples 


An algebra over F is a vector space . over F that also has a multiplication 
defined on it that makes into a ring such that if a © F and a,beE a”, 
a(ab) = (aa)b = a(ab). 


1.1. Definition. A Banach algebra is an algebra . over F that has a norm 
| - || relative to which # is a Banach space and such that for all a, b in SW, 
1.2 |]ab|| < |lal| |]4l). 

If . has an identity, e, then it is assumed that |je|| = 1. 
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The fact that (1.2) is satisfied is not essential. If 7 is an algebra and has a 
norm relative to which sx is a Banach space and is such that the map of 
AX @L— WY defined by (a, b) > ab is continuous, then there is an equiv- 
alent norm on that satisfies (1.2) (Exercise 1). 

If 7 has an identity e, then the map a +> ae is an isomorphism of F into 
L and ||ae|| = |a|. So it will be assumed that F C .» via this identification. 
Thus the identity will be denoted by 1. 

The content of the next proposition is that if . does not have an identity, 
it is possible to find a Banach algebra »/, that contains »/, that has an 
identity, and is such that dim #7, /7= 1. 


1.3. Proposition. Jf < is a Banach algebra without an identity, let ¥, = 
WX F. Define algebraic operations on x, by 


(i) (a,a) + (b,B) = (a+ b,a + B); 
(ii) B(a, a) = (Ba, Ba); 
(iii) (a,a)(b, B) = (ab + ab + Ba, a). 


Define ||(a, a)|| = |la|| + Ja]. Then x, with this norm and the algebraic 
operations defined in (i), (ii), and (iii) is a Banach algebra with identity (0, 1) 
and a > (a,0) is an isometric isomorphism of & into %,. 


ProoF. Only (1.2) will be verified here; the remaining details are left to 
the reader. If (a,a),(b,B) € #H,, then ||(a,a)(b, B)|| = ||(ab + Ba + ab, 
aB)|| = \lab + Ba + abl| + joB| < |fal| Ol] + [Bl {lal| + lal [dl] + lal |B) = 
Ca, a) (5, BDI. 


1.4. Example. If X is a compact space, then «= C(X) is a Banach 
algebra if ( fg)(x) = f(x)g(x) whenever f, g € Y and x € X. Note that 7 
is abelian and has an identity (the constantly 1 function). 


If X is completely regular and ~”= C,(X), then & is also a Banach 
algebra. In fact, C,(X) = C(BX) (V.6) so that this is a special case of 
Example 1.4. Another special case is /®. 


1.5. Example. If X is a locally compact space, “= C)(X) is a Banach 
algebra when the multiplication is defined pointwise as in the preceding 
example. . is abelian, but if X is not compact, ~ does not have an 
identity. If X,, is the one-point compactification of X, then C(X,,) > Co( X) 
and C( X,,) is a Banach algebra with identity. 


Note that cy is a special case of Example 1.5. 
1.6. Example. If (X, (2, 1) is a o-finite measure space and Y= L*(X, 2, w), 


then .% is an abelian Banach algebra with identity if the operations are 
defined pointwise. 
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1.7. Example. Let 2 be a Banach space and put ~= 4(2). If multipli- 
cation is defined by composition, then . is a Banach algebra with identity, 
1. If dim % > 2, o& is not abelian. 


1.8. Example. If 2 is a Banach space and w= 4,(%), the compact 
operators on @, then © is a Banach algebra without identity if dim 2 = oo. 
In fact, B(% ) is an ideal of B( 2X). 


Note that a special case of Example 1.7 occurs when ~= M_(F), the 
n Xn matrices, where © is given the norm resulting when M_,(F) is 
identified with @(F”). 


1.9. Example. Let G be a locally compact topological group and let 
M(G) = all finite regular Borel measures on G. If wu,» € M(G), define L: 
Co(G) > F by 


L(f) =f [fCy) de(x) do(y) = f [f(y) ao(y) du). 


Then L is a linear functional on C,(G) and 


LOA < f ffx) alul(x) divi(y) 


< |All Mell lll. 
So L € C,(G)* = M(G). Define p*v by L(f) = ffdu*v for f in C,(G). 
That is, 


1.10 [fduxy =f [f(y) dp(x) do(y). 


Note that ||y *v|| = ||ZI| < |||] ||v||. It follows that M(G) is a Banach 
algebra with this definition of multiplication. The product p * pv is called the 
convolution of p and ». 

Let e = the identity of G and let 6, = the unit point mass at e. If 
f € C,(G), then 


[faux 8. = f [f(y) du(x) 48,(y) 
= { f(xe) du(x) 


= { fap. 


So uw * 6, = p; similarly, 6, * w = uw. Hence 6, is the identity for M(G). 
If x, y © G, then it is easy to check that 6.*6,=6,, and M(G) is 
abelian if and only if G is abelian. 


1.11. Example. Let G be a locally compact group and let m = right Haar 
measure on G. That is, m is a non-negative regular Borel measure on G such 
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that m(U) > 0 for every nonempty open subset U of G and /f(xy) dm(x) 
= {f(x)dm(x) for every f in C.(G) (the continuous functions f: G > F 
with compact support). If G is compact, the existence of m was established 
in Section V.11. If G is not compact, m exists but its existence must be 
established by nonfunctional analytic methods (see Nachbin [1965]). 


If f,g © L'(m), let p = fm and v = gm as in the proof of (V.8.1). Then 
u,v © M(G) and ||u|| = |Ifll,, |r|] = Ilgl|,. In fact, the Radon—Nikodym 
Theorem makes it possible to identify L'(m) with a closed subspace of 
M(G). Is it a closed subalgebra? 

Let ¢ € CG). Then 


foauxv =f fo(xy)f(x)a(») dm(x) dm(y) 
= faly)| foo) fx) am(x)| dm) 
= fa(v)| foso9-) amt) dm(y) 
= f6()| {r-Ya() dm») | ant x) 


= [o(x)h(x) dm(x), 


where h(x) = {f(xy ')g(y)dm(y), x in G. It follows that h € L'(m) (see 
Exercise 4). Thus p * v = hm, so L\(m) is a Banach subalgebra of M(G). In 
fact, the preceding discussion enables us to define f * g in L'(m) for f, g in 
L'(m) by 


f* a(x) = [f(xy ")a(y) am(y). 


The algebra L1(m) is denoted by L(G). 

It can be shown that L(G) is abelian if and only if G is abelian and 
L(G) has an identity if and only if G is discrete (in which case L'(G) = 
M(G)—what is m?). This algebra is examined more closely in Section 9. 

If {.,} is a collection of Banach algebras, let ®).%, = {a € [],.%;: for 
all e > 0, {i: ||a(i)|| = e} is finite}. 


1.12. Proposition. If {.9,} is a collection of Banach algebras, ®,.%, and 
® ., @, are Banach algebras. 


PRooF. Exercise. 


EXERCISES 


1. Let be an algebra that is also a Banach space and such that if a © w, the 
maps x > ax and x xa of MW HM are continuous. Let WY, =. #%%X F as in 
Proposition 1.3. If a €#, define L,: Y% > WY, by L,(x,§) = (ax + €a,0). 
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Show that L, € @(#,) and if |jla||| = ||Z,||, then ||| - ||| 1s equivalent to the 
norm of & and & with ||j - ||| is a Banach algebra. 


2. Complete the proof of Proposition 1.3. 
3. Verify the statements made in Examples (1.4) through (1.9) and (1.11). 


4. Let G be a locally compact group. (a) If ¢ © C.(G) and ¢ > 0, show that there is 
an open neighborhood U of e in G such that ||@, — $,|| < ¢ whenever xy * € U. 
[Here $,(z) = (xz).] (b) Show that if f € L?(G), 1 < p < co, and e« > 0, there 
is an open neighborhood U of e in G such that ||f. — fll, < « whenever 
xy ' © U. (c) Show that if fe L(G) and g © L*(G), h(x) = 
{f(xy ')g(y) dm(y) defines a bounded continuous function h: G —> F. (d) If 
f,g © L(G) and h is defined as in (c), show that h € L'(G). 


5. Prove Proposition 1.12. 


6. Let {.,: i © J} be a collection of Banach algebras. (a) Show that ©,.7, is a 
closed ideal of ©, .W,. (b) Show that ®, ., has an identity if and only if each 
%, has an identity. (c) Show that ®,.%, has an identity if and only if J is finite 
and each », has an identity. 


7. If X,Y are completely regular, show that C,(X) ®,C,(Y) is isometrically 
isomorphic to C,(X ® Y), where X ® Y is the disjoint union of X and Y. 


8. If X and Y are locally compact, show that C)(X) ®,,C)(Y) is isometrically 
isomorphic to Co(X ® Y). 


9. Let {X,;: i€ I} be a collection of locally compact spaces and let X = the 
disjoint union of these spaces furnished with the topology {UC X: UN X, is 
open in X, for all i}. Show that X is locally compact and ©,C)(X;) is 
isometrically isomorphic to C)( X). 


§2. Ideals and Quotients 


If & is an algebra, a left ideal of W is a subalgebra 4 of Ww such that 
ax © M whenever a& HH, x © “M.A right ideal of H is a subalgebra 4 
such that xa ©.@ whenever a€ WwW, x €.4. A (bilateral) ideal is a 
subalgebra of . that is both a left ideal and a right ideal. 

If a€ ¥M and x has an identity 1, say that a is left invertible if there is 
an x in & with xa = 1. Similarly, define right invertible and invertible 
elements. If a is invertible and x, y@W# such that xa = 1 = ay, then 
y = ly = (xa)y = x(ay) = x1 = x. So if a is invertible, there is a unique 
element a~' such that aa~! = a~'a = 1. 

If .Z@ is a left ideal in 7, a € .4@, and a is left invertible, then 7 = wv. 
In fact, if xa = 1, then 1 © @ since -@ is a left ideal. Thus for y in ¥, 
y =yl © %. This forms a link between ideals and invertibility. 

In the case of a Banach algebra some bonuses occur due to the interplay 
of the norm and the algebra. The results of this section will be for Banach 
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algebras with an identity. To discuss invertibility this 1s, of course, the only 
feasible setting. For Banach algebras without an identity some analogous 
results can be obtained, however, by a consideration of the algebra obtained 
by adjoining an identity (1.3). The concept of a modular ideal and a 
modular unit can also be employed (see Exercise 6). 

The next proof is based on the geometric series. 


2.1. Lemma. If x is a Banach algebra with identity and x © & such that 
|x — 1|| < 1, then x is invertible. 


Proor. Let y =1— x: so |lyl] =r <1. Since |y”|] < |lyl” =r” (Why), 
Y2_olly"|| < oo. Hence z = L%_,y” converges in V. If z,=1l+y+y? 
+ ar ye, 


z(l—-y)=(ltyt--: ty") -(y ty? tee ty ty) ad a— yt 


But |ly"*1] < r7*1, so y"*! > 0 as n > oo. Hence z(1 — y) = limz,(1 — 
y) = 1. Similarly, (1 — y)z = 1. So (1 — y) is invertible and (1 — y)~' =z 
=Yey" Butl-y=1—-(l1-x)=x. J 


Note that completeness was used to show that Ly” converges. 


2.2. Theorem. [f <M is a Banach algebra with identity, G, = {a © @: a is 
left invertible}, G. = {a © &: a is right invertible}, and G = {ae #: ais 
invertible}, then G,, G,, and G are open subsets of &. Also, the map 


a+» a‘ of G > Gis continuous. 


ProoF. Let a, € G and let b5 © M such that bya, = 1. If |la — aol < 
\|o|| ~*, then ||bja — 1|| = ||Bo(@ — ao)|| < 1. By the preceding lemma, x = 
ba is invertible. If b = x~ "bo, then ba = 1. Hence G, > {a © &: ||a — ag| 
< ||bo||~'} and G, must be open. Similarly, G, is open. Since G = G,/N G, 
(Why?), G is open. 

To prove that a ~ a~' is a continuous map of G > G, first assume that 
{a,,} is a sequence in G such that a, > 1. Let 0 < 6 <1 and suppose 
\|a, — 1|| <6. From the preceding lemma, a,’ = (1 —(1-a,)) ‘= 

© (1 —a,)* =1+ 2,01 — a,)*. Hence 


7 = k 
la,’ -1l= || 0% A -a,) | 


k=1 
< ) jl —a,ll* 
k=1 
< 6/(1 — 8). 


If e > 0 is given, then 6 can be chosen such that 6/(1 — 6) <e. So 
\|a,, — 1|| < & implies ||a7* — 1|| < e. Hence lima; * = 1. 
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Now let a € G and suppose {a,} is a sequence in G such that a, > a. 
Hence a‘a, > 1. By the preceding paragraph, a;'a =(a ‘a,)~' 
Hence a>!=a,'aa'>a'. & 


> 1. 


Two facts surfaced in the preceding proofs that are worth recording for 
the future. 


2.3. Corollary. Let . be a Banach algebra with identity. 


(a) If |la — 1|| < 1, then a! = Y%_,(1 — a)*. 
(b) If bay = 1 and |\a — ao|| < ||bol|~*, then a is left invertible. 


A maximal ideal is a proper ideal that is contained in no larger proper 
ideal. 


2.4. Corollary. If < is a Banach algebra with identity, then 


(a) the closure of a proper left, right, or bilateral ideal is a proper left, right, 
or bilateral ideal, 
(b) a maximal left, right, or bilateral ideal is closed. 


PROOF. (a) Let .@ be a proper left ideal and let G, be the set of 
left-invertible elements in .~%. It follows that ™@M G,= 1. (See the intro- 
duction to this section.) Thus @C.&W\G,. By the preceding theorem, 
A&\ G, is closed. Hence cl. # C Y\ G;,; and thus cl. W7 # &. It is easy to 
check that cl .@ is an ideal. The proof of the remainder of (a) is similar. 

(b) If @ is a maximal left ideal, cl_Z is a proper left ideal by (a). Hence 
M = cl“ by maximality. i 


If .f does not have an identity, then . may contain some proper, dense 
ideals. For example, let .= C,(R). Then C,(R), the continuous functions 
with compact support, is a dense ideal in C,(R). There is something that can 
be said, however (see Exercise 6). 


2.5. Proposition. Jf x is a Banach algebra with identity, then every proper 
left, right, or bilateral ideal is contained in a maximal ideal of the same type. 


The proof of the preceding proposition is an exercise in the application of 
Zorn’s Lemma and is left to the reader. Actually, this is a theorem from 
algebra and it is not necessary to assume that .% is a Banach algebra. 

Let sf be a Banach algebra and let -@ be a proper closed ideal. Note 
that .//M becomes an algebra. Indeed, (x + W\ y+ W)=xy+ Misa 
well-defined multiplication on %/4. (Why?) 


2.6. Theorem. Jf » is a Banach algebra and & is a proper closed ideal in 
SL, then /M is a Banach algebra. If X has an identity, so does %/M. 
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PROOF. We have already seen that »7/Z@ is a Banach space and, as was 
mentioned prior to the statement of the theorem, .//Z is an algebra. If 
x,y €PM and u,v © M, then(x + u)(y + v) = xy + (xv + uy + uw) E xy 
+ M. Hence |\(x + “)\y + M)|| = |lxy + A < |[(x + u(y + v)I| S IIx 
+ ul| |v + v||. Taking the infimum over all u,v in @ gives that ||(x + 
M)\y + M)\| < \|lx + MI \ly + 4. The remainder of the proof is left to 
the reader. is 


It may be that »//7 has an identity even if 2 does not. For example, let 
J = C,(R) and let Z= {¢ = C,(R): (x) = 0 when |x| < 1}. If ¢ € 
C,(R) such that $.(x) = 1 for |x| <1, then ¢,+-.@ 1s an identity for 
Ht/M. In fact, if 6 & C,(R), ($¢ — o)(x) = O if |x| < 1. Hence (¢ + 
M)\(o) + 4) = 6 + 4 (see Exercises 6 through 9). 


EXERCISES 


1. Let » be a Banach algebra and let # be all of the closed left ideals in ~. If 
1, € F, define J, Vv I, = cl(J, + I,) and I, A I, = J, 1 1,. Show that with 
these definitions & is a complete lattice with a largest and a smallest element. 


2. Let X be locally compact. For every open subset U of X, let 1(U)= {¢¢€ 
C)(X): ¢ = 0 on X\ U}. Show that U > J(U) 1s a lattice monomorphism of 
the collection of open subsets of X into the lattice of closed ideals of Cy(X). (it 
is, in fact, surjective, but the proof of that should wait.) 


3. Let (X, 2, ) be a o-finite measure space and let J be an ideal in L~(X, 92, p) 
that is weak* closed. Show that there is a set A in (@ such that J/= {¢eE 
L©(X,2,p): 6 =00n A}. 

0 


4. Let w={|" At a, B&F} and tet = {| 5 |: Be F}. Show that 


is a Banach algebra and -@ is a maximal ideal in ww. 
5. Show that for n > 1, M,(C) has no nontrivial ideals. How about M,,(IR)? 


6. Let & be a Banach algebra but do not assume that ww has an identity. If J is a 
left ideal of ., say that J is a modular left ideal if there is a u in #& such that 
A(1—u)={a-—au: a€W} CT; call such an element u of # a right 
modular unit for I. Similarly, define right modular ideals and left modular units. 
Prove the following. (a) If u is a right modular unit for the left ideal J and 
u € I, then J =.%. (b) Maximal modular left ideals are maximal left ideals. (c) 
If J is a proper modular left ideal, then 7 is contained in a maximal left ideal. 
(d) If J is a proper modular left ideal and u is a modular right unit for /, then 
|| — x|| > 1 for all x in J and clJ is a proper modular left ideal. (e) Every 
maximal modular left ideal of is closed. 


7. Using the terminology of Exercise 6, let J be an ideal of 7. Show: (a) if u isa 
right modular unit for J and v is a left modular unit for J, then u — uv € J. (b) 
If J is closed, ./J has an identity if and only if there is a mght modular unit 
and a left modular unit for 7. Call an ideal 7 such that »/J has an identity a 
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modular ideal. An element u such that u + I is an identity for »/I is called a 
modular identity for I. 


8. If is a Banach algebra, a net {e,} in & is called an approximate identity for 
Wf if sup,|le;|| < oo and for each a in W, e,a > a and ae, > a. Show that YW 
has an approximate identity if and only if there is a bounded subset E of W 
such that for every e > 0 and for every a in # there is an e in E with 
\|Jae — al| + |lea — al] < «. 


9. Show that if X is locally compact, then C)(X) has an approximate identity. 
10. If # is a Hilbert space, show that @)(3) has an approximate identity. 


11. If G is a locally compact group, show that L'(G) (1.11) has an approximate 
identity. [Hint: Let Y= all neighborhoods U of the identity e of G such that 
clU is compact. Order Y by reverse inclusion. For U in %, let fy, = m(U) ‘xy. 
Then { f,,;: U € ZY} is an approximate identity for L'(G).] 


12. For 0<r<1, let P: 0D — [0, 0) be defined by P(z) = U2 _yr'"'z” (the 


Poisson kernel). Show that { P.} is an approximate identity for L'( dD) (under 
convolution). 


13. If # is a Hilbert space and P is a finite-rank projection, show that @,(#) P is 
a closed modular left ideal of 4,(3). What is the associated right modular 
unit? 


14. Find the minimal closed proper left ideals of M,,(F). 


15. Find the minimal closed proper left ideals of ®)(3#/), # a Hilbert space. How 
about for B)(2%), # a Banach space? 


16. What are the maximal modular left ideals of @)(#), # a Hilbert space? 


§3. The Spectrum 


3.1. Definition. If ./ is a Banach algebra with identity and a € W, the 
spectrum of a, denoted by o(a), is defined by 


o(a) = {a € F: a — ais not invertible}. 


The /eft spectrum, o,(a), is the set {a € F: a — a is not left invertible}; the 
right spectrum, o,(a), is defined similarly. 

The resolvent set of a is defined by p(a) = F \ o(a). The /eft and right 
resolvents of a are p,(a) = F \o,(a) and p,(a) = F \oa(a). 


3.2. Example. Let X be compact. If fe C(X), then o( f) = f(X). In fact, 
if a = f(x), then f— a has a zero and cannot be invertible. So f(X) C 
o( f). On the other hand, if a € f(X), f — a is a nonvanishing continuous 
function on X. Hence (f — a)~' € C(X) and so f — a is invertible. Thus 


a € o(/). 
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3.3. Example. If 2 is a Banach space and A © 4(2), then 0(A) = {a € 
fF: either ker(A — a) = (QO) or ran(A — a) # 4}. In fact, this means that 
p(A) = F \ 0(A) = {a © F: A — a is byective}. If a © p(A), there is an 
operator T in @(2%) such that T(A — a) = (A — a)T = 1; clearly, A — a 
is bijective. On the other hand, if A — a is bijective,(A — a) | © @(X) by 
the Inverse Mapping Theorem. 


3.4. Example. If # is a Hilbert and A € @4(#), then o,(A) = {a € F: 
inf{||(A — a)All: ||h|] = 1} = 0}. In fact, suppose B © 4(#) such that 
B(A ~ a) =1. If |||] =1, then 1 = [Al] = |B(A — All < ||BI (A - 
a)h||. So ||(A — a)hl| = ||Bl|~! whenever ||A|| = 1. 

Conversely, suppose ||(4A — a)h|| => 6 > 0 whenever ||h|| = 1. Note that 
ker(A — a) = (0). It will now be shown that ran(A — a) 1s closed. In fact, 
assume that (A — a)f, > g. Then lif, — fall < (A — af, — fy dIl = CA 
—a)f,—(A — a)f,||. Thus {f,} is a Cauchy sequence. Let f, — f. Then 
g=lim(A — a)f, = (A — a)f; hence g € ran(A — a). Let #= ran(A — 
a); so (A-—a): #-> X is a bijection. Thus (A — a) ': #> # is 
bounded. Define B: #-— # by letting B(k +h) =(A—- a) 'k when 
keE KX andhex-. Thus BE &(#) and B(A — a) = 1. 

3.5. Example. If <= M,(R) and A = ° et | then o( A) = C1. In fact, 
A — a is not invertible if and only if 0 = det(A — a) = a? + 1, which is 
impossible in R. 


The phenomenon of the last example does not occur if 7 is a Banach 
algebra over C. 


3.6. Theorem. [f ./ is a Banach algebra over C with an identity, then for 
each a in &, o(a) is a nonempty compact subset of ©. Moreover, if 
la| > |lall|, a € o(a) and z2(z-—a)"! is an valued analytic function 


defined on p(a). 


Before beginning the proof, a few words on vector-valued analytic func- 
tions are in order. If G is a region in C and & is a Banach space, define the 
derivative of f: G—> at z, to be lim, ..2~'[f(z) + 4) — f(zo)] if the 
limit exists. Say that f is analytic if f has a continuous derivative on G. The 
whole theory of analytic functions transfers to this situation. The statements 
and proofs of such theorems as Cauchy’s Integral Formula, Liouville’s 
Theorem, etc., transfer verbatim. Also, f: G — % is analytic if for each zy 
in G there is a sequence X,,X,,X,,-... m % such that f(z) = LF_o(z - 
Z,)*x, whenever z © B(z,;r), where r = dist(z), 0G). Moreover, the con- 
vergence is uniform on compact subsets of B( Zz); r). 

There is also a way of obtaining the vector-valued case as a consequence 
of the scalar-valued case (see Exercise 4). 
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PROOF OF THEOREM 3.6. If |a| > |jajj, then a — a = a(1 — a/a) and |la/a| 
< 1. By Corollary 2.3, (1 — a/a) is invertible. Hence a — a is invertible 
and so a € o(a). Thus o(a) C {a © C: |a| < |jal|} and o(a) is bounded. 

Let G be the set of invertible elements of x. The map a> (a — a) isa 
continuous function of C —.%. Since G is open and p(a) is the inverse 
image of G under this map, p(a) is open. Thus o(a)=C \ p(a) Is 
compact. 

Define F: p(a) > @ by F(z) = (z — a)~!. In the identity x=! — yp’ = 
x (y—x)yl, let x =(a+h-a) and y=(a-—a), where a € p(a) 
and h € C such that h # 0 and a +h €& p(a). This gives 


F(a +h)— F(a) _(a+h-a) ‘(-h)(a— a)” 
h h 
=-(at+th-a) (a-a). 
Since (a + h — a)! > (a-—a)'ash—-0, Fa) exists and 
F’(a) = —(a—a) ”. 


Clearly F’: p(a) — # is continuous, so F is analytic on p(a). 
From the first paragraph of the proof and Corollary 2.3, if |z| > ||all, 


ney B22 E(2 


ras 


k 


Hence 


Fest ¥ [lal 


lz é2o\ zl 
one een ne 
lz| 1 — flail /12| 


= (\z| — |lall) 


Thus F(z) > 0 as z > oo. Therefore if p(a) = C, F is an entire function 
that vanishes at oo. By Liouville’s Theorem F is constant. Since F’ # 0, this 
is a contradiction. Thus p(a) # C, or o(a) + 0. a 


Because the spectrum of an element of a complex Banach algebra is not 
empty, the following assumption is made. 


Assumption. Henceforward, all Banach spaces and all Banach algebras are 
over C. 


3.7. Definition. If 27 is a Banach algebra with identity and a © #, the 
spectral radius of a,r(a), is defined by 


r(a) = sup{|a|: a € o(a)}. 
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Because o(a) # 0 and is bounded, r(a) is well defined and finite; because 


o(a) is compact, this supremum is attained. 


Let «/= M,(C) and let A = : ‘ Then A? = 0 and o( A) = {0}; so 


r(A) = 0. So it is possible to have r(A) = 0 with A # 0. 


3.8. Proposition. Jf . is a Banach algebra with identity and a€& &, 
lim||a”||'/" exists and 


r(a) = limjja”||'7". 


Proor. Let G= {z €C: z=0 or z' € p(a)}. Define f: G >W by 
f(0) = 0 and for z #0, f(z) = (z~' — a)“'. Since (a — a)” ' > 0 asa 
oo, f is analytic on G, and so f has a power series expansion. In fact, by 
Corollary 2.3, for |z| < |la|}~}, 


(Q= Vee a2) 2a" 
n=0 n=0 
From complex variable theory, this power series converges for |z| < RK = 
dist(0, OG) = dist(0,0(a)~!) (Here o(a)~' = {z~': z € o(a)}). Thus R = 
inf{ja|]: @~' € o(a)} =r(a)'. Also, from the theory of power series, 
R~} = limsup]|a”||'/". Thus 


r(a) = limsup|ja”||!”". 


Now if a€C and n>1, a” — a" =(a — ala"! + a" “a 
+++) ta" )=(a" !+a"*at--- +a" '\(a-—a). So if a” -— a” is 
invertible, a — a is invertible and (a — a)! = (a” — a") ‘(a"' 
+ -+- +a”~1), So for a in o(a), a” — a” is not invertible for every n > 1. 
By Theorem 3.6, Ja|” < ||a”||. Hence Ja| < |la”||'/” for all n > 1 and a in 
o(a). So if a € o(a), |a| < liminf||a”||'””. Taking the supremum over all a 
in o(a) gives that r(a) < liminfija”||'/" < limsup|ja”||'“" = r(a). So r(a) 
=lim|ja"||\/". 


3.9. Proposition. Let < be a Banach algebra with identity and let a © &. 


(a) If a € p(a), then dist(a, o(a)) = |\(a — a) *||~¢. 
(b) If a, B € p(a), then 

(a -a)'-(B—a) '=(B-a)(a~a) (B~a) © 

=(B-a)(B-a) (a-a)’. 

PROOF. (a) By Corollary 2.3, if a © p(a) and ||x — (a — a)|| < ||(a - 
a)~+\|~1, x is invertible. So if B € C and |B| < |\(a — a) "||", (B + a— a) 
is invertible; that is, a + B € p(a). Hence dist(a, o(a)) > |\(a — a) "|| 7°. 
(b) This follows by letting x =a-—a and y=f-—a in the identity 
x-ba~y tex "y—x)yl=y (y-x)x oe 
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The identity in part (b) of the preceding proposition is called the resolvent 
identity and the function a ~ (a — a)” ‘of p(a) > & is called the resolvent 
of a. 


EXERCISES 


1. Let S be the unilateral shift on /* (IJ.2.10). Show that S is left invertible but not 
right invertible. 


2. If is a Banach algebra with identity and a € Ww and is nilpotent (that is, 
a” = 0 for some n), then o(a) = {0}. 


3. Let (X,02,) be a o-finite measure space and let W= L°(X,9,") (1.6). If 
¢ €#, show that the following are equivalent: (a) a €o(¢); (b) 0= 
sup{inf{|¢(x) — al: x © X\ A}: A © Q and p(A) = 0}; (c)ife > 0, u({x € X: 
lp(x) — a] < €}) > 0; (d) if » is the measure defined on the Borel subsets of C 
by v(A) = p(o '(A)), then a € the support of ». 


4. If G is an open subset of C and f: G > & is a continuous function such that for 
each x* in &*, x*o f: G > Cis analytic, then f is analytic. 


5. If is a Banach algebra with identity, {a,} CW, a, > a, a, € o(a,), and 
a, —~ a, then a € o(a). 


6. If is a Banach algebra with identity and r: .- [0, co) is the spectral radius, 
show that r is upper semicontinuous. If a € # such that r(a) = 0, show that r 
is continuous at a. 


7. If x is a Banach algebra with identity, a,b € Y, and a is a nonzero scalar such 
that (a — ab) is invertible, show that (a — ba) is invertible and (a — ba)~! = 
a ' + a 'b(a — ab)~'a. Show that o(ab) U {0} = o(ba) U {0} and give an 
example such that o(ab) # o(ba). 


§4. The Riesz Functional Calculus 


Before coming to the main course of this section, it 1s necessary to have an 
appetizer from complex analysis. Many of these topics can be found in 
Conway [1978] with complete proofs. Only a few results are presented here. 

If y is a closed rectifiable curve in C and a € {y} = {y(t):0 <t <1}, 
then the winding number of y about a is defined to be the number 

1 1 
n(y;a)=z=— dz. 
(y; a) 


27iJ.zZ—-—a 


The number n(y; a) is always an integer and is constant on each component 
of C \ {y} and vanishes on the unbounded component of C \ {y}. 

Let G be an open subset of C and let 2 be a Banach space. If f: G — ¥ 
is analytic and x* © %*, then z+> (f(z), x*) is analytic on G and its 
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derivative is (f’(z),x*). By Exercise 4 of the preceding section, if /f: 
G — & is a continuous function such that z > (f(z), x*) is analytic for 
each x* in 2@*, then f: G—><Z is analytic. These facts will help in 
discussing and proving many of the results below. 

If y is a rectifiable curve in G and f/ is a continuous function defined in a 
neighborhood of {y} with values in 2, then /,f can be defined as for a 
scalar-valued f as the limit in 2 of sums of the form 


ye ly (z;) a y(t,_,)| (4), 

j 
where {fo,,,...,¢,} is a partition of [0,1]. Hence f,f = fof(y(t)) dy(t) © 
#. It is easy to see that for every x* in 2%, (ff, x*) = fy f(-), x*). 


4.1. Cauchy’s Theorem. Jf 2 is a Banach space, G is an open subset of C, f: 
G — £& is an analytic function, and y,,...,Y,, are closed rectifiable curves in 
G such that Li_,n(y;3 4) = 0 for alla in C\ G, then Li, J, f = 0. 


ProoF. If x* © 2*, then (Lf, f,x*) = Lif, fC), x*) = 0 by the 
scalar-valued version of Cauchy’s Theorem. Hence 17, i f=0. a 


4.2. Cauchy’s Integral Formula. Jf 2 is a Banach space, G is an open subset 
of C, f: G—>€ is analytic, y is a closed rectifiable curve in G such that 
n(y; a) = 0 for every ain C\ G, and X © G\ {y}, then for every integer 
k > 0, 


nly A)f(d) = 5a [2 A) PPL) 


4.3. Definition. A closed rectifiable curve y is positively oriented if for 
every a in G\ {y}, n(y; a) is either 0 or 1. In this case the inside of y, 
denoted by ins y, is defined by 


insy = {a€C\{y}: n(y; a) = 1}. 
The outside of y, denoted by out y, is defined by 
outy = {ae C\{y}: n(y;a) = 0}. 


Thus C = {y} Uinsy U outy. 

A curve y: [0,1] — C is simple if y(s) = y(t) implies that either s = ¢ or 
s = 0 and ¢t = 1. The Jordan Curve Theorem says that if y is a simple 
closed rectifiable curve, then C \ {y} has two components and {y} ts the 
boundary of each. Hence n(y; a) takes on only two values and one of these 
must be 0; the other must be +1. 

If I = {¥1,---; Ym} iS a collection of closed rectifiable curves, then I’ is 
positively oriented if: (a) {y,} N{y,} =O for i # J; (b) for a in C\ 
UT i{y}, a3 a) = LiL n(7;; a) 1s either O or 1. The inside of I, ins I, is 
defined by 


ins = {a: n(T; a) = 1}. 
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The outside of I, out I’, is defined by 
out!’ = {a: n(T; a) = 0}. 


4.4. Proposition. Jf G is an open subset of C and K is a compact subset of G, 
then there is a positively oriented system of curves T = {7 ,...,Y_,} inG\ K 
such that K € insI’ and C\ GC outI. The curves y,,...,Y,, can be found 
such that they are infinitely differentiable. 


The proof of this proposition can be found on p. 195 of Conway [1978], 
though some details are missing. 
If I’ = (¥,,---,Yn} and each y, is rectifiable, define 


brent 


J=1 Y 


whenever f is continuous in a neighborhood of {I}. 

Let be a Banach algebra with identity and let a © .w. One of the 
principal uses of Proposition 4.4 in this book will occur when K = o(a). If 
f: G— C is analytic and o(a) C G, we will define an element f(a) in 
by 


4.5 fla) = 55 f f(2)z- a) Ne 


where I is as in Proposition 4.4 with K = o(a). But first it must be shown 
that (4.5) does not depend on the choice of I’. That is, it must be shown that 
f(a) is well defined. 


4.6. Proposition. Let & be a Banach algebra with identity, let a & x, and 
let G be an open subset of € such that o(a) C G. If T = {y,...,Y,,} and 
A = {A,,...,A,} are two positively oriented collections of curves in G such 
that o(a) € insI and o(a) C insA and if f: G > C is analytic, then 


JIE ~a) ‘daz= Mole —a) ‘dz. 


Proor. For 1 <j <k, let y,,,,;=j*; that is, y,,,,(¢f)=A,(1 — ¢) for 
O<t<1.Ifz € G\o(a), theneitherz © C\ Gorz € o(a). Ifz EC \G, 
then Li‘n(y,;3 2) = n(T; z) —n(A;z)=0-0=0. If z€o(a), then 
vr n(y,; 2) = nT; z) — n(A;z)=1-1=0. Thus 2 = {y l<j<m 
+ k} is a system of closed curves in U = G\ o(a) such that n(2;z) = 0 
for all z in C\ U. Since z > f(z)(z — a)~' is analytic on U, Cauchy’s 
Theorem implies 


0 = ice ~a) ‘a= f {Ol ~a)'az- J fC ~a) "dz. 


a) 
As was pointed out before, Proposition 4.6 implies that (4.5) gives a 
well-defined element f(a) of s whenever f is analytic in a neighborhood 
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of o(a). Let Hol(a) = all of the functions that are analytic in a neighbor- 
hood of o(a). Note that Hol(a) is an algebra where if f, g © Hol(a) and f 
and g have domains D(f) and D(g), then fg and f+ g have domain 
D(f) O D(g). Hol(a) is not, however, a Banach algebra. 


4.7. The Riesz Functional Calculus. Let < be a Banach algebra with identity 
and leta © &. 


(a) The map f - f(a) a Hol(a) — # is an algebra homomorphism. 

(b) If f(z) = Le 202" nas radius of convergence > r(a), then f © Hol(a) 
and f(a) = X3_pa,a*. 

(c) df f(z) =-1, hen f(a) = 1. 

(d) If f(z) =z for all z, f(a) =a. 

(ce) If f,fi,f,-.. are all analytic on G, o(a)CG, and f,(z) > f(z) 
uniformly on compact subsets of G, then ||f,(a) — f(a)|| ~ 0 as n > oo. 


PROOF. (a) Let f, g © Hol(a) and let G be an open neighborhood of o(a) 
on which both f and g are analytic. Let I’ be a positively oriented system of 
closed curves in G such that o(a) C insI’. Let A be a positively oriented 
system of closed curves in G such that (insI’) U {f°} = clans!) € ins A. 
Then 


fla)a(a) = — 5] [flee = a) Mae] ften0e a) ae 


= 5 ff Hedee~ a) Ea) aba 


1 (z= a) -(S- a)” 
mereey a) sea} = |aa 


Z 


[by (3.9b)] 


-Lfre| : BE) |: —a) ‘de 


dala febele—or'a 


Aq? 


But for § on A, § € outI and hence /,[ f(z)/(§ — z)] dz = 0 (Cauchy’s 
Theorem). If ze {I}, then ze€insA and so /f,[g(§)/( — z)] df = 
27ig(z). Hence 


fla)g(a) = x f fl2)e(2Mz- a) ‘de 
= (fg)(a). 


The proof that (af + Bg)(a) = af(a) + Bg(a) is left to the reader. 
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(c) and (d). Let f(z) =2*, k > 0. Let y(t) = Rexp(27it), O<¢ <1, 
where R > |la||. So o(a) C ins and hence 


jis mi 22 Pe 


ee ee ee 
- Fai J? (1 =) ae 
: sei 20? De atyetae, 


Oni 
since ||a/z|| < 1 for |z| = R. Since this infinite series converges uniformly 
for z on y, 


f(a) = Ela! 


If n # k, then z-“~**” has a primitive and hence f,z~‘"~ ero de = = 0. For 
n = k this integral becomes {,z~'dz = 2ai. Hence f(a) = a‘ 

(e) Let = {y,,---,Y%} be a positively oriented system of closed curves 
in G such that o(a) C insI’. Fix 1 < k < m; then 


| [tee = a) Ne = f fe )(2 = 0) ee 


oad. zn k+1 


rr a" 


=| ['LCn(9) ~PaOE(9 = al Mar | 
S MACRO) — fly(t))I Ly, (7) 7 a| ~l d\y,|(t). 


Now t+ ||[y,(t) — a]~"|| is continuous on [0,1] and hence bounded by 
some constant, say M. Thus 


(oc ~ a) "de = f f(2)e =a) 


< M\ly,||max{f,(z) — f(z): z © {x3}, 


where ||y,|| 1s the total variation (length) of y,. By hypothesis it follows that 
IIf,(@) — f(a)|| > . as n> 0. 

(b) If p(z) = L2_a,z * is a polynomial, then (a), (c), and (d) combine to 
give that p(a)= Fn 9a, a". Now let f(z) = L%_,a,z* have radius of 
convergence R > r(a), the spectral radius of a. If p,(z) = L%_,a,z*, 
P, (2) > f(z) uniformly on compact subsets of {z: |z| < R}. By (e), p,(a) 
— f(a). So (b) follows. x 


The Riesz Functional Calculus is used in the study of Banach algebras 
and 1s especially useful in the study of linear operators on a Banach space 
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(Sections 6 and 7). Now our attention must focus on the basic properties of 
this functional calculus. The first such property is its uniqueness. 


4.8. Proposition. Let © be a Banach algebra with identity and let a © &. 
Let t: Hol(a) — & be a homomorphism such that (a) t(1) = 1, (b) t(z) = a, 
(c) if {f,,} is a sequence of analytic functions on an open set G such that 
o(a) C Gand f,(z) > f(z) uniformly on compact subsets of G, then t(f,,) > 


T(f). Then +(f) = f(a) for every f in Hol(a). 


PROOF. The proof uses Runge’s Theorem (III.8.1), but first it must be shown 
that r( f) = f(a) whenever f is a rational function. If nm > 1, 7(z") = 7(z)” 
=a"; hence t( p) = p(a) for any polynomial p. Let g be a polynomial 
such that g never vanishes on o(a), so 1/q © Hol(a). Also, 1 = 7(1) = 
(qq ') = 1(q)t(q_') = q(a)t(q_'). Hence g(a) is invertible and 
qg(a)' = 7(q~'). But using the Riesz Functional Calculus, a similar argu- 
ment shows that g(a)~' = (1/q)(a). Thus 7(qg~') = (1/q)(a). Therefore if 
f = p/q, where p and gq are polynomials and q never vanishes on o(a), 
(f)=7(p-q-')=7(p)1(q*) = p(ay(1/q(a) = f(a). 

Now let f € Hol(a) and suppose f is analytic on an open set G such that 
o(a) C G. By Runge’s Theorem there are rational functions { f, } in Hol(a) 
such that f,(z) — f(z) uniformly on compact subsets of G. By (111) of the 
hypothesis, 7(f,) > 7(f). But 7(f,) =f,(a@) and f(a) > f(a) by (4.7e). 
Hence 7(f) = f(a). a 


A fact that has been implicit in the manipulations involving the func- 
tional calculus is that f(a) and g(a) commute for all f and g in Hol(a). In 
fact, if r: Hol(a) — & is defined by r(a) = f(a), then f(a)g(a) = t( fg) = 
7(gf) = g(a)f(a). Still more can be said. 


4.9. Proposition. Jf a,b <€.%, ab = ba, and f © Hol(a), then f(a)b = 
bf(a). 


ProoF. An algebraic exercise demonstrates that f(a)b = bf(a) if f is a 
rational function with poles off o(a). The general result now follows by 
Runge’s Theorem. = 


4.10. The Spectral Mapping Theorem. Jf a € and f © Hol(a), then 
o( f(a)) = f(o(a)). 


PRooF. If a € o(a), let g © Hol(a) such that f(z) — f(a) = (z — a)g(z). 
If it were the case that f(a) € o( f(a)), then (a — a) would be invertible 
with inverse g(a)[ f(a) — f(a)]~*. Hence f(a) € o( f(a)); that is, f(o(a)) 
Cc o( f(a)). 

Conversely, if 8 € f(o(a)), then g(z) =[f(z) — B]~* € Hol(a) and so 
g(a)[ f(a) — B\) = 1. Thus B € o( f(a)); that is, o( f(a)) C f(o(a)). of 
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This section closes with an application of the functional calculus that is 
typical. 


4.11. Proposition. Suppose a € WY and o(a) = F, U F,, where F, and F, 
are disjoint nonempty closed sets. Then there is a nontrivial idempotent e in 
such that 


(a) if ba = ab, then be = eb; 
(b) if a, = ae anda, =a(1 — e), thena =a, + a, and a,a, = a,a, = 0; 
(c) o(a,) = F, U {0}, o(a,) = F, U {0}. 


PROOF. Let G,,G, be disjoint open subsets of C such that EF. C.Giy 7 = 152. 
Let I’ be a positively oriented system of closed curves in G, such that 
F, CinsI, F, € out I. If f = the characteristic function of G,, f © Hol(a); 
let e = f(a). Since f? = f, e* = e. Part (a) follows from (4.9). 

Note that e(1 — e) = 0 = (1 — e)e. Hence (b) is immediate. Let f,(z) = 
Zf(z), f(z) = 2. — f(z)). It follows from (4.7a) that a; = f(a), j = 1,2. 
Hence the Spectral Mapping Theorem implies that o(a,) = f,(o(a)) = F,U 
{0}. fe) 


Part (c) of the preceding proposition has the somewhat unattractive 
conclusion that o(a,) = F, U {0}. It would be much neater if the conclu- 
sion were that o(a,) = F;. This is, in a sense, the case. Since a,(1 — e) = 0 
and 1—e#0, a, cannot be invertible. However, consider the algebra 
A, = Ve. \t is left to the reader to show that ., is a Banach algebra and e 
is the identity for W,. If a, is considered as an element of the algebra v7, 
then its spectrum as an element of ., is F;. This is an illustration of how 
the spectrum depends on the Banach algebra (the subject of the next 
section; also see Exercise 9). 


EXERCISES 


1. Let w= C(X), X compact (see Example 3.2). If g € C(X) and f € Hol(g), 
show that f(g) = fog. 


2. Let a be a nilpotent element of . For f,g in Hol(a), give a necessary and 
sufficient condition on f and g that f(a) = g(a). 


3. Let d>1 and let A € M,(C). Give a necessary and sufficient condition on f in 
Hol( A) such that f(a) = 0. (Hint: Consider the Jordan canonical form for A.) 


4. If x is a Banach algebra with identity, a € Ww, f € Hol(a), and g is analytic in 
a neighborhood of f(o(a)), then go f € Hol(a) and g(f(a)) = ge f(a). 


5. If is a Banach space, A € @(#), and WM < & such that (A —a) ‘MC H 
for all a in p(A), show that f(A)4 C4 whenever f € Hol( A). 


6. If % is a Banach space, A € @(#), and f © Hol(A), show that f(A)* = f(A*). 
(See (6.1) below.) 
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7: If KH isa Hilbert space, A © #(#), and f © Hol(A), show that f(A)* = 
f(A*), where f(z) = f(Z). (See (6.1) below.) 


8. If # is a Hilbert space, A is a normal operator on #, and f © Hol(A), show 
that f(A) is normal. 


9. Let 2 be a Banach space and let A € 4(2#). Show that if o(A) = F, UF, 
where F,, F, are disjoint closed subsets of C, then there are topologically 
complementary subspaces %2,,%, of % such that (a) B2,;C 2% (j = 1,2) 
whenever BA = AB; (b) if A; = A|%,, 0(A,;) = F; (c) there is an invertible 
operator R: #— #, ©,%, such that RAR”! = A, © A). 


10. Let A © M,(C), o(A) = {q,..., a, }, where a; # a; for 1 # j. Show that for 
1 <j <n there is a matrix A, in M, (©) such that 0(A,;) = {a,} and A is 
similar to A; ® --- @A,,. 


§5. Dependence of the Spectrum on the Algebra 


If DD = {z €C: |z| = 1}, let @ = the uniform closure of the polynomials 
in C(dD). (Here “polynomial” means a polynomial in z.) If “= C(dD), 
then the spectrum of z as an element of & is 0D (Example 3.2). That is, 


64(z) = 0D. 


Now z € & and so it has a spectrum as an element of this algebra; 
denote this spectrum by o,(z). There is no reason to believe that og(z) = 
0,,(z). In fact, they are not equal. 


5.1. Example. If @ = the closure in C( dD) of the polynomials in z, then 
o,(z)=clD. 

To see this first note that ||z|| = 1, so that og(z) C clD by Theorem 3.6. 
If |A| < 1 and A € o,(z), there is an f in @ such that (z — A)f = 1. Note 
that this implies that |A| <1. Because f€ @, there is a sequence of 
polynomials { p,} such that p, — f uniformly on oD. Thus for every e > 0 
there is a N such that for m,n >WN, €> |lp, — parllap = sup{|P,(2) — 
P,(z)|: z = 0D}. By the Maximum Principle, ¢ > ||p,, — p,,||ap for m,n = 
N. Thus g(z) = lim p,(z) is analytic on D and continuous on cl D; also, 
g|oD = f. By the same argument, since p,(z)(z — A) > 1 uniformly on 
oD, p,(z\(z — A) > 1 uniformly on D. Thus g(z)(z — A) = 1 on D. But 
1 = g(A)(A — A) = 0, a contradiction. Thus, clD C og(z). 


Thus the spectrum not only depends on the element of the algebra, but 
also the algebra. Precisely how this dependence occurs is given below, but it 


VII.5. Dependence of the Spectrum on the Algebra 211 


can be said that the example above is typical, both in its statement and its 
proof, of the general situation. To phrase these results it is necessary to 
introduce the polynomially convex hull of a compact subset of C. 


5.2. Definition. If A is a set and f: A — C, define 


Illa = sup{|f(z)|: z © A}. 


If K is a compact subset of C, define the polynomially convex hull of K to 
be the set K given by 


K = {z €C: |p(z)| < |lp|lx for every polynomial p}. 
The set K is polynomially convex if K = K . 


Note that the polynomially convex hull of dD is clD. This is, again, quite 
typical. If K is any compact set, then C \ K has a countable number of 
components, only one of which is unbounded. The bounded components are 
sometimes called the holes of K; a few pictures should convince the reader 
of the appropriateness of this terminology. 


5.3. Proposition. Jf K is a compact subset of C, then C \ K is the unbounded 
component of © \ K. Hence K is polynomially convex if and only if C \ K is 
connected. 


PROOF. Let U,U,,... be the components of C\ K, where Up is un- 
bounded. Put L =C€\ U,; hence L = K UU*%_,U,. Clearly K CK. If 
n > 1, then U, is a bounded open set and a topological argument implies 
dU, < K. By the Maximum Principle U, C K. Thus, L Cc K- 

If a € U,, (z — a)~* is analytic in a neighborhood of L. By (IIL8.5), 
there is a sequence of polynomials { p,} such that ||p, — (z — a)~'||, > 0. 
If g, = (z — a)p,, then ||¢, — 1||, — 0. Thus for large n, ||¢, — 1||, < 1/2. 
Since K C L and |q,(a) — 1| = 1, this implies that a € K. Thus K C L. 

mi 


5.4. Theorem. If < and & are Banach algebras with identity such that 
BCcAManda é€ B&, then 


(a) o,,(a) € og(a) and dog(a) € d0,(a). 

(b) 0,,(a) = og(a) . 

(c) If Gis a hole of o,(a), then either G C og(a) or GN og(a) = 0. ; 
(d) If B is the closure in & of all polynomials in a, then og(a)=90,(a) . 


PrRooFr. (a) If a € og(a), then there is a b in @ such that b(a — a) = 
(a—a)b=1. Since BCH, a Ea,(a). Now assume that A © dog(a). 
Since into,(a) € intog(a), it suffices to show that A € o(a). Suppose 
A € o,,(a); there is thus an x in & such that x(a — a) =(a-—a)x = 1. 
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Since A € dog(a), there is a sequence {A ,,} in C \ og(a) such that A, > X. 
Let (a — d,,)~' be the inverse of (a — A,,) in @; so(a — A,,)* EM. Since 
A,, 2A, (a —A,) > (a — X). By Theorem 2.2, (a — A,,)~' > x. Thus x € 
B since & is complete. This contradicts the fact that A © og(a). 

(b) This is a consequence of (a) and the Maximum Principle. 

(c) Let G be a hole of o,(a) and put G, = GN og(a) and G, =G\ 
0g(a). So G = G, U G, and G, MN G, = O. Clearly G, is open. On the other 
hand, the fact that dog(a) € 52a) and GM o,(a) = UO implies that G, = 
GM intog(a), so G, 1s open. Because G is connected, either G, or G, is 
empty. 

(d) Let # be as in (d). From (a) and (b) it is known that o,(a) € og(a) 
Co,(a) . Fix in o,(a) . If A €ag(a), (a-A) |e BCH. Hence 
there is a sequence of polynomials { p, } such that p,(a) > (a — A)~'. Let 
g,(Z) = (z — A)p,(z). Thus ||¢,(a) — 1|| ~ 0. By the Spectral Mapping 
Theorem, o,,(q,(@)) = q,(6,(a)). Thus, because \ € o(a) , 


lig,(a@) — 1] = r(q,(@) — 1) 
= sup{|z — 1]: z € oy(9,(a))} 
= sup{|q,(w) — 1|: we o,(a)} 


lan(A) - 1 
= 1. 


IV 


This is a contradiction. wi 


EXERCISES 


1. If K is a compact subset of C, let P(K) be the closure of the polynomials in 
C(K). Show that the identity map on polynomials extends to an isometric 
isomorphism of P(K) onto P(K ). 


2. If K is a compact subset of C, let R(K) be the closure in C(K) of all rational 
functions with poles off K. If fe R(K), show that og(x,(f) = f(K). Tf fe 
P(K), show that op;x)(f) = f(K ). 


3. Let ,@ be as in Theorem 5.4. If a € @ and og(a) C R, show that og(a) = 
0.(a). 
4. Let & be a Banach algebra with identity and let a © Ww. If G,,G,,... are the 


holes of o,,(a) and 1 <n, < n,..., show that there is a subalgebra 4 of W 
such that a € @ and og(a) = o,,(a) UUZ_IG, . 


5. If &, 4, and a are as in Theorem 5.4, .» is not abelian, and @ is a maximal 
abelian subalgebra of &, show that o,(a) = 0g(a). 


6. If K is a nonempty compact subset of C that is polynomially convex, show that 
the components of int K are simply connected. 


VII.6. The Spectrum of a Linear Operator 213 


§6. The Spectrum of a Linear Operator 
The proof of the first result is left as an exercise. 


6.1. Proposition. 


(a) If & is a Banach space and A © &(2), o(A*) = o( A). 
(b) If # is a Hilbert space and A © B(#), o( A*) = o( A)*, where for any 
subset A of C, A* = {z: zE A}. 


In this section only results about operators on Banach spaces will be 
given. For the corresponding results about operators on a Hilbert space 
involving the adjoint, the reader is asked to supply the details. The preced- 
ing proposition should be kept in mind as a model of the probable 
differences. 

In this section and the next 2 always denotes a Banach space over C. 


6.2. Definition. If A © 4(2), the point spectrum of A,o,(A), is defined 
by 


o,(A) = {AX © C: ker(A — A) # (0)}. 


As in the case of operators on a Hilbert space, elements of o,( A) are called 
eigenvalues. If \ € 0,(A), vectors in ker(A — A) are called eigenvectors; 
ker(A — A) is called the eigenspace of A at X. 


6.3. Definition. If A € &(2), the approximate point spectrum of A, 6,,(A), 
is defined by 


o,,(A) = {A © C: there is a sequence { x, } in 
such that ||x,|| = 1 for all n and ||(A — A)x,|| > 0}. 
Note that 0,(A) ¢ o,,(A). 


6.4. Proposition. [f A € @(%) and X€&C, the following statements are 
equivalent. 


(a) A € 9,,(A). 
(b) ker(A — A) = (0) and ran(A — Xd) is closed. 
(c) There is a constant c > 0 such that ||(A — A)x|| = c||x|| for all x. 


PROOF. Clearly it may be assumed that A = 0. 

(a) = (c): Suppose (c) fails to hold; then for every n there is a vector x, 
with ||Ax,l| < [lxgl|/n. If y, =x,/llXyll. llysl| = 1 and ||Ay,|| > 0. Hence 
0 € o,,(A). 

(c) = (b): Suppose ||Ax|| > c||x||. Clearly ker A = (0). If Ax, > y, ||x, - 
Xmll < ¢ '||Ax, — Ax,,||, So {x,} is a Cauchy sequence. Let x = lim x,; 
therefore Ax = y and ran A is closed. 
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(b) = (a): Let Y= ran A; so A: > Y is a continuous biection. By the 
Inverse Mapping Theorem, there is a bounded operator B: Y—> % such 
that BAx = x for all x in 2. Thus if ||x|] = 1, 1 = ||BAx|| < || Bl] || Ax]. 
That is, ||Ax|| > ||B\|~' whenever ||x|| = 1. Hence 0 € 0,,(A). iz 


It may be that o,(A) is empty, but it will be shown that o,,(A) is never 
empty. The first statement follows from the next result (or from other 
examples that have been presented); the second statement will be proved 
later. 


6.5. Proposition. Jf 1 < p < 00, define S: 1? > 1” by S(x,,x3,...) = 
(0, x,,X2,...). Then o(S) = clD, o6,(S) =O, and o,,(S) = dD. Moreover, 
for |A| < 1, ran(S — A) is closed and dim[l?/ran(S — d)] = 1. 


ProoF. Let S, be the shift on /?. For 1 < p < 00, define T,: /? — 1? by 
T,(X1,X2,---) = (X2, X3,--.). It is easy to check that for 1 < p < oo and 
1/p + 1/q = 1, S* = T,,. Since ||S,|| = 1, o(S,) € cl D. 

Suppose x = (X,,X),...)E/?, A#0. If S,<-> Ax, O=AxX,, xX, = 
AX,,... . Hence 0 =x, =x, = --- . Since S, is an isometry, kerS, = (0). 
Thus o,(S,) = 0. 

Let 1<p<oo and |A| <1. Put x, =(1,A,A’,...). Then |[x,|% = 
Lee o|A?|” < 00. Also, T,x, = (A,X’,...) =AxX,. Hence A € a,(T,) and 
x, © ker(T, — A). If 1 <p < 00 and l/p + 1/q=1, T,=S%; soDc 
0(T,) = o(S,). Also, S,, = T/*, so D € o(S,,). Thus for all p, D € o(S,) 
C clD. Since o(S,) is necessarily closed, o(S,) = cl D. 

If |A| <1 and x e€/?, |\(S, — A)x|l, = IIS,x — Axl], = IIS), — 
IMilellp| = LLxllp — IMIlxllp| = @ — IADILxI|,. By 6.4), 4 € 0,,(S,). Hence 
o,,(S,) © OD. The fact that o,,(S,) = dD follows from the next proposi- 
tion (6.7). 

Fix |A| < 1, so ran(S, — A) is closed. If 1 < p < oo, then dim[/?/ran(S, 
— d)] = dimfran(S, — A)*] (Why?) = dim ker(S* — A) (VI1.8) = 
dim ker(7, — A). Also, dim[/®/ran(S,, — A)] = dimf[ ~ran(S,, — A)] (Why?) 
= dimker(7, — A). So to complete the proof it suffices to show that 
dimker(T, —A)=1 for 1<ps<oo. If xe/? and T,x =Ax, then 
(X5,X%3,---) = (AX, AX,,...). So x,4, =Ax, for all n. Thus x, = Ax); 
X; =Ax, = A*x,,.... Hence x,,, = A"x,. That is, if x, = (1,A,A’,...), 
then x = x,x,. Since it has already been shown that x, € ker(T, — A), this 
completes the proof. i] 


6.6. Corollary. Jf 1 < p < 0 andT: |? > I” is defined by T(x,, X5,...) = 
(x5,%X3,...), then o(T)=clD and for |A| <1, ker(T — A) is the one- 
dimensional space spanned by the vector (1, X, ’,...). 


The next result shows that if S is as in (6.5), then 0D C a,,,(S). 
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6.7. Proposition. [f A © @(2), then do(A) C o,,(A). 
PRrooF. Let A € do(A) and let {A,,} C C \ o(A) such that A, > X. 


6.8. Claim. ||(4 — ,)~ "|| ~ 00 as n > 00. 


In fact, if the claim were false, then by passing to a subsequence if 
necessary, it follows that there is a constant M such that ||(A —A,)71|| < M 
for all n. Choose n sufficiently large that |A, — A| < M~*. Then ||(A — A) 
—(A-A,)|| < (A —A,)~"]~*. By (2.3b), this implies that (A — A) is 
invertible, a contradiction. This establishes (6.8). 

Let ||x,|| = 1 such that a, = |\(A —A,)~‘x,|| > (A —A,)-'] — 07 
so a, — oo. Put y, =a, ‘(A —A,)7‘x,; hence |ly,|| = 1. Now 

(APN) yg (AS hy) ONE A, 
Oe Xp Ng) Ve 
Thus ||(A — A)y,|| < a, + |A —A,|, so that ||(A — A)y,|| ~ Oasn > oo. 
That is, A€o,,(A). 


Let A © &(2) and suppose A is a clopen subset of o( A); that is, A is a 
subset of o( A) that is both closed and relatively open. So o(A)=AU 
(o(A)\ A). As in Proposition 4.11 (and Exercise 4.9), 


6.9 E(A) = E(4: A) = 555 f(z - A) Nae, 


where I’ is a positively oriented Jordan system such that A C insI and 
o0(A)\A C out I, is an idempotent. Moreover, E(A)B = BE(A) whenever 
AB = BA and if 2, = E(A)&, o( A|%,) = A. If A =a singleton set {A}, 
let E(A) = E({A}) and 2 = %,y,- Note that if A is an isolated point of 
o( A), then {A} is a clopen subset of o( A). 


6.10. Example. Let {a,} @/~, 1 <p < oo, and define A: /? >/” by 
(Ax)(n) = a,x(n). Then o(A) = cl{a,,} and o,(A) = {a,}. Let e,(k) = 0 
if kK #n and 1 if k =n. For each k, define N, = {n EN; a, = a,} and 
define P,: 1? > I? by P,x = xy_,x. If a, is an isolated point of o( A), then 
{a,} 1S a clopen subset of o( A) and E({a,}; A) = P,. 


Suppose A © #(#) and X, is an isolated point in o( A). Hence E(A,) = 
E(A,Q; A) is a well-defined idempotent. Also, A, is an isolated singularity of 
the analytic function z  (z — A)~' on C \ o(A). Perhaps the nature of 
this singularity (pole or essential singularity) will reveal something of the 
nature of A, as an element of o(A). First it is helpful to get the precise 
form of the Laurent expansion of (z — A)~! about A,. 


6.11. Lemma. Jf i, is an isolated point of o( A), then 


(2-4) = Y (2-A0)"4, 


n 
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for 0 < |z — Xl < % = dist(Ag, o(A) \ {AQ }), where 
_ 1 —n—-] —] 
An = Fai J (2 By asaya 
for y = any circle centered at \, with radius < ry. 


The proof follows the lines of the usual Laurent series development 
(Conway [1978)]). 


6.12. Proposition. Jf , is an isolated point of o(A), then X, is a pole of 
(z — A): of order n if and only if (Ay — A)"E(A,) =9 and (Ay - 
A)" 'E(A,) # 0. 

Proor. Let (z — A)°>' = ©&__.(z — A,)"A, as in (6.11). Now X4 is a pole 
of order n if and only if A_, #0 and A_,=0 for k >n. Let I’ bea 
positively oriented system of curves such that o(A)\ {A,)} C insI’ and 
A, € out I’. Let y be a circle centered at A, and contained in out I’. Let 


e(z) = 1 ina neighborhood of y U insy and e(z) = 0 in a neighborhood of 
I UinsI. Soe e€ = and e(A) = E(A,). If k => 1, 


ai [2 ~ do) (e~ AY ae 


1 


vit pene 


2mi J, 4 


= E(Xg)(A ~ Ao)” 
since o(A) C ins(y + I) =insy UinsI. The proposition now follows. 
| 


6.13. Corollary. If A, is an isolated point of o(A) and is a pole of 
(z — A)7', thend, € 0,( A). 


In fact, the preceding result implies that if n is the order of the pole, then 
(0) # (A, — A)” VE(A,)&C ker(A — Xo). 


6.14. Example. A measurable function k: [0,1] <x [0,1] > C is called a 
Volterra kernel if k is bounded and k(x, y) = O when x < y.If1<p<o 
and k is a Volterra kernel, define V,: L?(0,1) ~ L?(0, 1) by 


Vf (x) = [k(x 9100) = [ke ya) a. 


Then V, € @(L?”) and ||V,|| < |Al|,, (V1-2.3). 
If k,h are Volterra kernels and 


(hk) x.y) = fUh(x. k(t, y) de 
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then hk is a Volterra kernel, ||hk]||,, < |All ,,||kl|,,, and V,, = V,V,. Note 
that if k(x, y) is the characteristic function of {(x, y) © [0,1] x [0,1]: 
y < x}, then V, is the Volterra operator (II.1.7). 

If &k is a Volterra kernel, then 


o(V,) = {0}. 
Indeed, from the preceding paragraph it is known that V,” = V,,. This will 
be used to show that the spectral radius of V, is 0. 


MAN. 
(n — 1)! 


This is proved by induction. Clearly it holds for n = 1. Suppose (6.15) is 
true for some n = 1. Then 


K(x, y) -| [eek y) 


6.15. Claim. |k"(x, y)]| < ———(x — y)""! for y < x. 


< fie onike(e, 9) dt 


kil" px _ 
< Keg ad (t—y)" dt 
anes 


— WA . 
ay (xy). 
This establishes the claim. 
From (6.15) it follows that 
n ; WAM 
We'll < WA" 


° S (n= 1) 
Therefore 
We" < [ello E(w = 1s". 
Since [(n — 1)!]'”" > Oasn > oo, r(V,) = 0. ThusO ¥ o(V,) C {A EC: 
|A| < 0}; that is, o(V,) = {0}. 
It is possible for kerV, to be nontrivial. For example, if k(x, y) = 
X0,1/2(¥) when y < x and 0 otherwise, then 


je if x <4, 
Vif (x) = 


So if f(y) =0 for0<y<35,V,f=0. 

On the other hand, the Volterra operator V [= V, for k(x, y) = the 
characteristic function of {(x, y): y<x}] has kerV = (0). In fact, if 
0 = Vf, then for all x, 0 = {of(y) dy. Differentiating gives that f = 0. 
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Is there an analogy between V, for a Volterra kernel & and a lower 
triangular matrix? 


EXERCISES 
1. Prove Proposition 6.1. 


2. Show that for # a Banach space and A in &(2%), o,(A) = 6,(A*). What 
happens in a Hilbert space? 


3. If # is a Hilbert space and K is a compact subset of C, show that there is an A 
in @(#) such that o(A) = K. Can A be found such that 0(A) = o,,(A) = K? 


4. Let K be a compact subset of C. Does there exist an operator A in 4(C[0, 1]) 
such that o(A) = K? 


5. If & is a Banach space and A € @(2), show that A is left invertible if and 
only if ker A = (0) and ran A is a closed complemented subspace of 2. 


6. If & is a Banach space and A € 4(2), show that A is right invertible if and 
only if ran A = & and ker A is a complemented subspace of 2. 


7. If & is a Banach space and T: #— & is an isometry, then either o(7) C OD or 
o(7T)=clD. 


8. Verify the statements made in Example 6.10. 


9. Let 1 < p < o0 and suppose 0 < a, < a, --- such that r = lima, < oo. De- 
fine A: /? > 1? by A(x), X>,...) = (0, 0,1, &)X>,...). Show that o(A) = {z 
EC: |z| <r} and o,,(A) = 00(A). If |A] < r, then ran(A — A) is closed and 
has codimension 1. Also, 6,(A) =. 


10. Verify the statements made in Example 6.14. 


11. Let 1 < p < © and let (X, 2,1) be a o-finite measure space. For @ in L*(p), 
define M, on L?(p) as in Example III.2.2. Find o(M;), 9,,(M,), and o,(M,). 


12. If Ae B(#), f © Hol(A), and X €o0,(A), does f(A) € 6,(f(A))? If AE 
0,,(A), does f(A) € o,,(f(A))? Is there a relation between f(o,,(A)) and 
0,)(f(A))? 

13. If A € @(&) say that a complex number A has finite index if there is a positive 
integer k such that ker(A — A)* = ker(A — A)**!; the index of X, denoted by 
v(A) or v,(A), is the smallest such integer k. (a) Show that if A is an isolated 
point of o(A) and a pole of order n of (z — A)~', then v(A) =n. (b) If 
v(A) < 00, show that ker(A — A)’ = ker(A — A)’ ** for all k > 0. (c) If 

0 

1 O 

#=C" and A = 1 , then 0(A) = {0} and v(0) = 27. 

1 O 

14. If V is the Volterra operator, show that 0 is an essential singularity of 
(z-V)7'. 


15. Let & be a Banach algebra with identity. If a € W, define L,,R, © B( PM) by 
L,(x) = ax and R,(x) = xa. Show that o(L,) = o(R,) = 9(a). 
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§7. The Spectral Theory of a Compact Operator 


Recall that for a Banach space %, @,(#) is the algebra of all compact 
operators. This Banach algebra has no identity, so if A € #,(%), then 
o(A) refers to the spectrum of A as an element of 4(2%). Of course, if 
A= B(X)+C, then x is a Banach algebra with identity (Why?) and we 
could consider o (A) for A in @)(2%). By Theorem 5.4, o(A) € o,(A), 
00,(A) € o(A), and 0(A) = o(A) . Below, in Theorem 7.1, it will be 
shown that o(A) is a countable set and hence o(A) = 00(A) = o( A) . 
Thus o( A) = o,(A). 


7.1. Theorem. (F. Riesz) Jf dim 2= co and A © &)(#), then one and only 

one of the following possibilities occurs. 

(a) o( A) = {0}. 

(b) o( A) = {0,A,,...,A,}, where for 1 <k <n, 4, #0, each X, is an 
eigenvalue of A, and dimker(A — X,) < 00. 

(c) o( A) = {0,A,,A5,...}, where for each k = 1, X, is an eigenvalue of A, 
dim ker(A — A,) < 0, and, moreover, lim), = 0. 


The proof will use several lemmas. The first lemma was given in the case 
that % is a Hilbert space in Proposition II.4.14. The proof is identical and 
will not be repeated here. 


7.2. Lemma. If A © @)(2%), X\ #0, and ker(A — A) = (0), then ran(A — 
dX) is closed. 


The proof of the next lemma is like that of Corollary IT.4.15. 


7.3. Lemma. If A © @)(%), A #0, and d € o( A), then either  € 0,(A) 
or X € 0,(A*). 


7.4. Lemma. If 7< V< %, M#WN, and «> 0, then there is a y in W 
such that ||y|| = 1 and dist(y, #7) >=1— «. 


PROOF. Let 5( y) = dist(y, 4) for every y in WY. Now if y, © YW, there is 
an x, in 4 such that 8(y,) < ||x> — yi|| < + €)d(y,). Let y, = y, — Xo. 
Then (1 + €)d(y,) = (1 + e)inf{||y, — x): x EW} = (1 + e)inf{|ly, — x, 
— x||: x € #}=(1 + €)8()y,) since x) € 4. Thus (1 + €)6( yy) > ||xq - 
Yill = |ball- Let y = |lyall~"y2- So |lyll = 1, »y © Y, and if x €.Z, then 

ly — xl] = IIball “v2 — =| 


= [Lyall 2 — Walla > [G+ €)8(¥2)] 2 — ial 
>(l+e)'>1l-e 


If @ and / are finite dimensional in the preceding lemma, then y can 
be chosen in ™ such that ||y|| = 1 and dist( y, @) = 1 (see Exercise 1). 
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7.5. Lemma. /f A © &)(2) and {X,,} is a sequence of distinct elements in 
0,(A), then limA,, = 0. 


PROOF. For each n let x, © ker(A — A,,) such that x, # 0. It follows that if 
M ,, =N{x),,..-,X,}, then dim .Z, = n (Exercise). Hence 4%, <.@,,, and 
M , #M,,,,. By the preceding lemma there is a vector y, in 4, such that 
lly, {| = 1 and dist(y,, 4, ,) > 4. Let y, = a,x, +--+ +a,x,. Hence 


(A oT VN) Yn = a, (A, a Nn )%4 ae ew TOG Ns Sy a 1 ee EM, _\. 
Soifn>m, 
A(Aj1y,)— A( An Yn) = ARCA — Ag) Pe — AA — A) Pn + In Ian 
= Vy —[¥m t MmM(A — Xin) Yn — ADA — AL) Sal 


But the bracketed expression belongs to .@,_,. Hence ||A(A;'y,) — 
A(A;,*y,,)|| = dist(y,, 4,-1) > 3- Therefore A(A; 'y,) can have no conver- 
gent subsequence. But A is a compact operator so that if S is any bounded 
subset of %, cl A(S) is compact. Thus it must be that {A> 'y,} has no 
bounded subsequence. Since ||y,|| = 1 for all n, it must be that ||A>*y,|| = 
|A,| 1 > oo. Thatis,O=limA,. 


PROOF OF THEOREM 7.1. The first step is to establish the following. 
7.6. Claim. If X € o(A) and A + 0, then JA is an isolated point of o( A). 


In fact, if {A,} GC o(A) and A, >A, then each A, belongs to either 
o,(A) or o,(A*) (7.3). So either there is a subsequence {A,, } that is 
contained in 0,(A) or there is a subsequence contained in o,(A*). If 
{A,,,} © 6,(A), then Lemma 7.5 implies A,, > 0, a contradiction. If {A,,, } 
C o,(A*), then the fact that A* is compact gives the same contradiction. 
Thus A must be isolated if A # 0. 


7.7. Claim. If A © o(A) and A # 0, then A € o,(A) and dimker(A — A) 
< ©. 


By (7.6), A is an isolated point of o( A) so that E(A) can be defined as in 
(6.9). Let 2, = E(A)# and A, = A|%,. By Exercise 4.9 [also see (4.11)], 
o0(A,) = {A}. Thus A, is an invertible compact operator. By Exercise 
V1I.3.5, dim 2%) < oo. If n = dim 2), then A, — A is a nilpotent operator on 
an n-dimensional space. Thus (A, — A)” = 0. Let v = the positive integer 
such that (4, — A)” =0 but (A, —A)’"' #0. Let x © %, such that 
0 #(A, — A)’ tx = y; then (A — A)y = 0. Thus A € 0,( A). 

Also, ker(A — A) € Lat A and Alker(A — A) is compact. But Ax = Ax 
for all x in ker(A — A), so dimker(A — A) < oo. 

Now for the dénouement. If dim 2 = 00 and A € &,(2), then A cannot 
be invertible (Exercise VI.3.5). Thus 0 € o(A). If A € o( A) and A #0, 
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then Claim 7.7 says that A © o,(A) and dimker(A — A) < 00. So if o( A) is 
finite, either (a) or (b) of (7.1) hold. If o(A) is infinite, then Claim 7.6 
implies that o(A) is countable. So let o(A) = {0,A,,A5,...}. By Lemma 
7.5 and Claim 7.7, (c) holds. Sa 


Part of the following surfaced in the proof of the theorem. 


7.8. Corollary. If A € @)(2%) and \ © o(A) with X # 0, then X is a pole 
of (z — A)~+, ker(A — A) C E(A)&, and dim E(A)& < oo. 


PRooF. The only part of this corollary that did not appear in the preceding 
proof is the fact that ker(A — A) C E(A)&. 

Let A = 0(A)\ {A}, % = E(A)%, Ag = A|%. By Exercise 4.9, 0(A,) 
= A; so A, — AX is invertible on %,. If x € ker(A — A), then x = E(A)x + 
E(A)x. Hence 0 = (A —A)x = (A —A)E(A)x + (A — A)E(A)x = (Ay 
— A)E(A)x + (A, —A)E(A)x. But 2%, and %,€ Lat A, so (A, - 
A)E(A)x © £, and (A, — A)E(A)x © 4; since % M %, = (0), 0 = (Ay 
— A)E(A)x = (A, — A)E(A)x. But A, — A is invertible so E(A)x = 0; 
that is, x = E(A)x © &,. Hence ker(A — A) C 2). | 


If &k is a Volterra kernel (6.14), then V, is a compact operator (Exercise 
VI.3.6) and o(V,) = {0}. So the first possibility of Theorem 7.1 can occur. 
If V is the Volterra operator, then o,(V) = O. 

Let V be the Volterra operator on L7(0,1),1 < p < oo. If A,,...,A, € C€, 
let D: C” >C” be defined by D(z,,...,2,) = (A,2,,...,A,2,). Then 
A=V®@®D on L?(0,1) 6 C” is compact and o( A) = {0,A,,...,A,}. So 
the second possibility of (7.1) occurs. If {A,} CC and limA, = 0, then 
define D: /? > 1? (1 <p < o) by (Dx)(n) =A,x(n). If A =VED on 
L?(Q,1) @/?, A is compact and o( A) = {0,A,,A,...} (see Exercise 3). 

The next result has a number of applications in the theory of integral 
equations. 


7.9. The Fredholm Alternative. Jf A € 4(%), X€C, and X\ #0, then 
ran(A — A) is closed and dimker(A — ) = dimker(A — A)* < o. 


PROOF. It suffices to assume that A © o( A). Put A = 0(A)\ {A}, Z = 
E(A)4, 4, = E(A)#, Ay = A|%,, and A, = A|%. Now X € A = (A, ), 
so A, — A is invertible. Thus ran(A, — A) = %,. Hence ran(A — A) = (A 
—XA)#, + (A — ANB, = ran(A, — A) + &. Since dim 2, < oo, ran(A — 
A) is closed (III.4.3). 

Also note that 


#/ran(A — A) = (4%, + )/[ran( A, -— A) + %| 
= #)/ran(A, — A). 
Since dim 2%, < o, dim[%/ran(A — A)] = dim 2, — dimran(A, — A) = 
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dim ker(A, — A) = dimker(A — A) < 00 since ker(A — A) C 2) (7.8). But 
[%/ran(A — A)]* = [ran(A — A)]~ CIIL.10.2) = ker(A — A)*. Hence 
dim ker(A — A) = dimker(A — A)*. | 


7.10. Corollary. If A <= @ (2%), \€&C, and X #0, then for every y in & 
there is an x in & such that 


7.11 (A-A)x=y 


if and only if the only vector x such that (A — A)x =0 is x = 0. If this 
condition is satisfied, then the solution to (7.11) is unique. 


This corollary is a rephrasing of part of the Fredholm Alternative 
together with the fact that an operator has dense range if and only if its 
adjoint has a trivial kernel. 

The applications of the Fredholm Alternative occur by taking the com- 
pact operator to be an integral operator. 


EXERCISES 


1. If &,M are finite-dimensional subspaces of @ and @< V,M#N, then 
there is a y in Y such that ||y|| = 1 and dist(y, 7) = 1. 


2. Let A € @(#) and let 4,,...,A,, be distinct points in o,(A). If x, © ker(A — 


’ n 


A,),1 <k <n,and x, # 0, show that {x,,...,x,,} 1s a linearly independent set. 
3. Let %,,#5,... be Banach spaces and put #= © 2. Let 4, © @(%,) such 
that sup,||A,|| < oo and define A: 2-2 by A{x,} = {A,x,}. Show that 


A © B(#) and ||A]} = sup, ||4,||. Show that A © #)(#) if and only if each 
A, © @(#&) and lim]|A,,|| = 0. 


4. Suppose A © @(#) and there is a polynomial p such that p(A) © 4 (2). 
What can be said about o( A)? 


5. Suppose A © &(#) and there is an entire function f such that f(A) € #(#). 
What can be said about o( A)? 


6. With the terminology of Exercise 6.13, if A € B@(#), A © o(A), and A # 0, 
what can be said about the index of X? 


§8. Abelian Banach Algebras 


Recall that it is assumed that every Banach algebra is over C. Also assume 
that all Banach algebras contain an identity. 

A division algebra is an algebra such that every nonzero element has a 
multiplicative inverse. It may seem incongruous that the first theorem in this 
section allows the algebra to be nonabelian. However, the conclusion is that 
the algebra is abelian—and much more. 
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8.1. The Gelfand—Mazur Theorem. Jf . is a Banach algebra that is also a 
division ring, then Z= C (= {Al: AE C}). 


ProoF. If a € &, then o(a) # O. If A € o(a), then a — A has no inverse. 
But © is a division ring, so a — A = 0. That is, a = X. es 


As a corollary of the preceding theorem, the algebra of quaternions, H, is 
not a Banach algebra. That is, it is impossible to put a norm on H that 
makes it into a Banach algebra. Can you show this directly? 


8.2. Proposition. Jf & is an abelian Banach algebra and 4 is a maximal 
ideal, then there is a homomorphism h: s—C_ such that M= kerh. 
Conversely, if h: S&C is a nonzero homomorphism, then kerh is a 
maximal ideal. Moreover, this correspondence h > kerh between homomor- 
phisms and maximal ideals is bijective. 


PRooF. If 4 is a maximal ideal, then -Z is closed (2.4b). Hence %/@ isa 
Banach algebra with identity. Let 7: ~”—> W~/M be the natural map. If 
a &@ and z(a) is not invertible in ¥%/Z, then 7(.#a) = r(a)[| V/7] is 
an ideal in »/@ that is proper. Let J= {bE WM: m(b) € r(Va)} = 
a ‘(a(.a)). Then I is a proper ideal of and WCT. Since 7 is 
maximal, # = J. Thus 7(a¥v) C m(1) = 2(@) = (0). That is, 7(a) = 0. 
This says that ~/@ is a field. By the Gelfand—Mazur Theorem »~/// = C 
= {X+4: EC}. Define h: H/M > C by h(A +.@) =D and define h: 
sf > C by h =hoz. Then h is a homomorphism and kerh =. 

Conversely, suppose h: w&—C is a nonzero homomorphism. Then 
kerh = 4 is a nontrivial ideal and »&/4 = C. (Why?) So @ is maximal. 

If h, h’ are two nonzero homomorphisms and kerh = kerh’, then there is 
an a in C such that h = ah’ (A.1.4). But 1 = h(1) = ah’(1) = a, so h = 
h’. i] 


8.3. Corollary. Jf © is an abelian Banach algebra and h: #%—>C is a 
homomorphism, then h is continuous. 


PRooFr. Maximal ideals are closed (2.4b). ei 


The next result improves the preceding corollary a little. Remember that 
by (8.3) if h: s&— C is a homomorphism, then h € x* (the Banach space 
dual of #7). 


8.4. Proposition. Jf & is abelian and h: &— C is a homomorphism, then 
Al] = 1. 


PROOF. By (8.3), h © Y* so that ||h|| < oo. Let a © HM and put A = h(a). 
If jA| > |la|j, then ||a/A|| < 1. Hence 1 — a/A is invertible. Let b = (1 — 
a/\)', so 1 = b(1 — a/A) = b — ba/X. Since h(1) = 1, 1 = h(b — ba/d) 
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= h(b) — h(b)h(a)/A = h(b) — h(b) = 0, a contradiction. Hence |a|| > 
|A| = |h(a)|; so ||A|| < 1. Since AQ) = 1, |lAl| = 1. Ei 


8.5. Definition. If 27 is an abelian Banach algebra, let + = the collection 
of all nonzero homomorphisms of ~%— C. Give » the relative weak* 
topology that it has as a subset of ../*. & with this topology is called the 
maximal ideal space of &. 


8.6. Theorem. Jf © is an abelian Banach algebra, then its maximal ideal 
space & is a compact Hausdorff space. Moreover, if a © &%, then o(a) = 


3(a) = (h(a): h € 3}. 


PROOF. Since & C ball ./%*, it suffices for the proof of the first part of the 
theorem to show that 2 is weak* closed. Let {h,} be a net in & and suppose 
h & ball ~#* such that h,;>h weak*. If a,be xv, then h(ab) = 
lim ,h (ab) = lim ,h,(a)h,(b) = h(a)h(b). So h is a homomorphism. Since 
hQ) = lim,h,(1) = 1, h € 2. Thus 2 is compact. 

If h& + and A = h(a), then a — A € kerh. So a — X 1s not invertible 
and A € a(a); that is, X(a) € o(a). Now assume that A € o(a); soa—A 
is not invertible and, hence, (a — A)# is a proper ideal. Let @ be a 
maximal ideal in . such that (a -A) WCW. If h © XY such that Z= 
kerh, then 0 = h(a — A) = h(a) — A; hence o(a) € 2(a). a 


Now it is time for an example. Here is one that is a little more than an 
example. If X is compact and x € X, let 6.: C(X)—C be defined by 
5.(f) = f(x). It is easy to see that 6, is a homomorphism on the algebra 
C(X). 


8.7. Theorem. Jf X is compact and = is the maximal ideal space of C(X), 
then the map x > §, is a homeomorphism of X onto &. 


ProoF. Let A: X — 2 be defined by A(x) = 6,. As was pointed out before, 
A(X) ¢€ &. It was shown in Proposition V.6.1 that A: X — (A(X), weak*) 
is a homeomorphism. Thus it only remains to show that A(X) = 2. If 
h © S&S, then there is a measure p in M(X) such that h(f) = {fdp for all f 
in C(X). Also, ||u|| = ||2]| = 1 and p(X) = fldp = h(1) = 1. Hence p = 0 
(Exercise III.7.2). Let x © support (). It will be shown that h = 6,. 

Let Z= {fe cx): f(x) = 0}. So 4 is a maximal ideal of C(X). 
Note that if it can be shown that kerh CZ, then it must be that kerh =.4@ 
and so h = 6,. So let f € kerh. Because kerh is an ideal, |f|* = ff < kerh. 
Hence 0 = hA(|f|7) = /|f|? du. Since 4 > 0 and |f|* > 0, it must be that 
f= 0 ae. [pw]. Since f is continuous, f = 0 on support (p). In particular, 
f(x) = 0 and so fE%. a 


It follows from the preceding theorem that the maximal ideals of C(X) 
are all of the form { f © C(X): f(x) = 0} for some x in X. 
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8.8. Definition. Let «/ be an abelian Banach algebra with maximal ideal 
space 2. If a © &W, then the Gelfand transform of a is the function 4a: 
& — C defined by a(h) = h(a). 


8.9. Theorem. /f . is an abelian Banach algebra with maximal ideal space 
x anda & &, then the Gelfand transform of a, a, belongs to C(2). The map 
a> @ of & into C(2) is a continuous homomorphism of # into C(d) of 
norm 1 and its kernel is 


N{ 4: M is a maximal ideal of x}. 
Moreover, for each a in &, 
a||_, = lim |Ja”|)!7”. 
n— co 


Proor. If h; > h in 2, then h; > h weak* in #*. So if aE ¥#, a(h;) = 
h(a) > h(a) = a@(h). Thus @ € C(4X). 7 

Define y: YW C(2) by y(a) = 4. If a,b € A, then y(ab)(h) = ab(h) 
= h(ab) = h(a)h(b) = (h)b(h). Therefore y(ab) = y(a)y(b). It is easy 
to see that y is linear, so y is a homomorphism. Also, by (8.4), if a € &%, 
|a(h)| = |h(a)| < |lal|; thus |ly(a@)ll, = Il4ll. < llall. So y is continuous 
and ||y|| < 1. Since y(1) = 1, |]y|| = 1. 

Note that a € kery if and only if a = 0; that is, a € kery if and only if 
h(a) = 0 for each h in . Thus a € ker y if and only if a belongs to every 
maximal ideal of 7. 

Finally, by Theorem 8.6, if a © %, then ||4||,, = sup{|A|: A © o(a)}. The 
last part of this theorem is thus a consequence of this observation and 
Proposition 3.8. i 


The homomorphism a> 4 of # into C(2%) is called the Gelfand trans- 
form of &. The kernel of the Gelfand transform is called the radical of 
WA ,rad WM. So 


rad Y= ){ 4: M is a maximal ideal of 7}. 


If X is compact and 2, the maximal ideal space of C(X), is identified 
with X as in Theorem 8.7, then the Gelfand transform C(X) > C(2) 
becomes the identity map. 

If 7 is an abelian algebra, say that a in & isa generator of # if { p(a): 
p is a polynomial} is dense in .~. 

Recall that if +: X — Y is a homeomorphism, then A: C(Y) > C(X) 
defined by Af = fer is an isometric isomorphism (VI.2.1). Denote the 


relationship between A and 7 by A = 7%. 


8.10. Proposition. [f x is an abelian Banach algebra with identity and a is a 
generator of &, then there is a homeomorphism +t: & — o(a) such that if y: 
wf — C(2) is the Gelfand transform and p is a polynomial, then y( p(a)) = 
T™(p). 
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PROOF. Define t: + — o(a) by t(h) = h(a). By Theorem 8.6 7 is surjec- 
tive. It is easy to see that 7 is continuous. To see that 7 1s injective, suppose 
T(h,) = t(h), so h(a) = h,(a). Hence h,(a”) = h,(a") for all n > 0. By 
linearity, h,( p(a)) = h,( p(a)) for every polynomial p. Since a is a genera- 
tor for and h, and h, are continuous on &, h, =h,, and 7 is injective. 
Since x is compact, 7 is a homeomorphism. 

The remainder of the proposition follows from the fact that y and 7* are 
homomorphisms. Hence y(p(a))(A) = p(y(a))(h) = p(ay(h) = p(a(h)) = 
p(t(h))=7*(pyh). of 


8.11. Corollary. Jf x has two generators, a, and a,, then o(a,) and o(a,) 
are homeomorphic. 


The converse to (8.11) is not true. If “w”= C[—1,1], then f(x)=x 
defines a generator f for o. If g(x) = x’, then o(g) = g({[—1,1)) = [0,1]. 
So o( f) and o(g) are homeomorphic. However, g is not a generator for ~. 
In fact, the Banach algebra generated by g consists of the even functions in 
C[—1, 1]. 


8.12. Example. If V: L°(0,1) > L7(0,1) is the Volterra operator and # is 
the closure in @( L7(0,1)) of { p(V): p is a polynomial in z}, then is an 
abelian Banach algebra and rad w= cl{ p(V): p is a polynomial in z and 
p(0) = 0}. In other words, . has a unique maximal ideal, rad .~. In fact, if 
B= B(L*(0,1)), Theorem 5.4 implies that do,(V) C og(V) Co (V). 
Since og(V ) = {0} (6.14), o,(V) = {0}. The statement above now follows 
by Proposition 8.10. 


8.13. Example. Let .° be the closure in C( 0D) of the polynomials in z. If 
> is the maximal ideal space of .»/, then » is homeomorphic to o,(z). 
(Here z is the function whose value at A in 0D is A.) Nowo,(z) = clD as 
was shown in Example 5.1. If f € #, then the Maximum Modulus Theorem 
shows that f has a continuous extension to clD that is analytic in D [see 
(5.1)]. Also denote this extension by f. The proof of (8.10) shows that the 
continuous homomorphisms on .~ are of the form f +> f(A) for some A in 
clD. 


In the next section the Banach algebra L'(G) is examined for a locally 
compact abelian group and its maximal ideals are characterized. 


EXERCISES 


1. Let »@ be a Banach algebra with identity and let J be the smallest closed 
two-sided ideal of # containing {xy — yx: x,y €#}. J is called the commu- 
tator ideal of ®&. (a) Show that »/J is an abelian Banach algebra. (b) If J is a 
closed ideal of . such that »//I is abelian, then J DJ. (c) If hi: W>Cisa 
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10. 


11. 
12. 


homomorphism, then J C kerh and h induces a homomorphism h: </J > C 
such that hoa = h, where 7: x &/J is the natural map. Hence ||h|| = 1. (d) 
Let = be the set of homomorphisms of »#— C and let © be the set of 
homomorphisms of »/J. Show that the map hh defined in (c) is a 
homeomorphism of = onto 2. 


. Using the terminology of Exercises 2.6 and 2.7, let & be an abelian Banach 


algebra without identity and show that if M is a maximal modular ideal, then 
there is a homomorphism h: <-> C such that = kerh. Conversely, if h: 
Sf — € is a nonzero homomorphism, then kerh is a maximal modular ideal. 
Moreover, the correspondence h*+> kerh is a bijection between homomor- 
phisms and maximal modular ideals. 


. If # is an abelian Banach algebra and h: »&— C is a homomorphism, then h 


is continuous and ||/|| < 1. If « has an approximate identity {e,} such that 
lle;|| < 1 for all i, then |jh|| = 1 (see Exercise 2.8). 


. Let be an abelian Banach algebra and let & be the set of nonzero 


homomorphisms of .-— C. Show that » is locally compact if it has the relative 
weak* topology from .* (Exercise 3). 


. With the notation of Exercise 4, assume that & has no identity and let », be 


the algebra obtained by adjoining an identity. For a in , let o(a) be the 
spectrum of a as an element of ., and show that o(a) = {h(a):h © 2} U {0}. 
Also, show that the maximal ideal space of ~,, ,, is the one-point compactifi- 
cation of 2. 


. With the notation of Exercise 4, for each a in # define 4: 2 > C by 


a(h) = h(a). Show that a4 © C)(“) and the map a> 4 of W into C,(2) isa 
contractive homomorphism with kernel = {\{.4@: -@ is a maximal modular ideal 
of #7}. 


. If X is locally compact, show that x > 6, is a homeomorphism of X onto the 


maximal ideal space of C,( X). 


. Let X be locally compact and for each open subset U of X let C(U) = {fe 


Co(X): f(x) = 0 for x in X\ U}. Show that C)(U) is a closed ideal of C)( X) 
and that every closed ideal of C)(X) has this form. Moreover, the map 
U- © (U) is a lattice isomorphism from the lattice of open subsets of X onto 
the lattice of ideals of C)(X). 


. With the notation of the preceding exercise, show that C,(U) is a modular ideal 


if and only if X \ U is compact. 


If sf is an abelian Banach algebra and a € W, say that a is a rational generator 
of #¥ if { f(a): f is a rational function with poles off o(a)} is dense in &. 
Show that if a is a rational generator of ., then x is homeomorphic to o(a). 


Verify the statements made in Example 8.12. 


Say that a,,...,a, are generators of # if # is the smallest Banach algebra 
with identity that contains {a,,...,a,,}. Show that a,,...,a, are generators of 
Sf if and only if w= cl{ p(a,...,a,): p is a polynomial in n complex 
variables z,,...,Z, }, and if 2 is the maximal ideal space, then there is a 
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homeomorphism 7 of 2 onto a compact subset K of C” such that if p is a 
polynomial in n variables, then y( p(a,,...,4@,)) = T*(p). 


13. Verify the statements made in Example 8.13. 


14. (Zelazko [1968].) Let . be a Banach algebra and suppose $: w- C is a linear 
functional such that $(a*)=$(a)’ for all a in &. Show that @ is a 
homomorphism. 


15. Let # be an abelian Banach algebra with identity that is semisimple [that is, 
rad ¥ = (0)]. If || - || is the norm on w and || - ||, is another norm on # that 
also makes & into a Banach algebra, then these two norms are equivalent. 
(Hint: use the Closed Graph Theorem to show that the identity map 7: 
(H, || - ||) > (%, || - |],) is continuous.) 


16. Let & be as in Example 8.13 and let K = {¢ € #*: (1) = |||| = 1}. Show 
that ext K = {6,: |z| = 1}. (See (V.7).) 


17. Show that f(x) = exp(vix) is a generator of C((0,1]) but g(x) = exp(27/x) 1s 
not. 


§9*. The Group Algebra of a Locally Compact 
Abelian Group 


If G is a locally compact abelian group and m is Haar measure on G, then 
L(G) = L(m) is a Banach algebra (Example 1.11), where for f, g in L'(G) 
the product f * g is the convolution of f and g: 


fe a(x) = | f(y )a(y) ay, 


Note that dy is used to designate integration with respect to m rather than 
dm(y). Because G is abelian, L'(G) is abelian. In fact, g*f(x)= 
fe(xy~')f(y) dy. If y~'x is substituted for y in this integral, the value of 
the integral does not change because Haar measure is translation invariant. 
Hence g * f(x) = {g(y) f(y" 'x) dy = fa(y) f(xy") dy = f * B(x). 

Let e denote the identity of G. If G is discrete, then 6, € L’(G) and 6, is 
an identity for L1(G). If G is not discrete, then L'(G) does not have an 
identity (Exercise 1). 

Some examples of nondiscrete locally compact abelian groups are R” and 
T”, where T = the unit circle 0D in C with the usual multiplication. Note 
that T~ is also a compact abelian group while R®@ fails to be locally 
compact. The Cantor set can be identified with the product of a countable 
number of copies of Z, and is thus a compact abelian group. Indeed, the 
product of a countable number of finite sets (with the discrete topology) 1s 
homeomorphic to the Cantor set, so that the Cantor set has infinitely many 
nonisomorphic group structures. 
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For a topological group G, L'(G) is called the group algebra for G. If G 
is discrete, the algebraists talk of the group algebra over a field K as the set 
of all f= 21,-¢a,g, where a, € K and a, + 0 for at most a finite number 
of g in G. If K =C, this is the set of functions f: G —>C with finite 
support. Thus in the discrete case the group algebra of the algebraists can be 
identified with a dense manifold in L'(G) = /'(G). 

Unlike §V.11, if f: GC and x €G, define f.: G—>C by f(y) = 
f(yx7}); so f(y) = f(x7'y) for G abelian. We want to examine the 
function x > f, of G > L?(G), 1 < p < o. To do this we first prove the 
following (see Exercise V.11.10). 


9.1. Proposition. If G is a topological group and f: G — C is a continuous 
function with compact support, then for any « > 0 there is a neighborhood U of 
e such that |f(x) — f(y)| < € whenever x~"y € U. 


Proor. Let &@ be the collection of open neighborhoods U of e such that 
U = U“'. Note that if V is any neighborhood of e, than U=VOAV'Eey 
and U Cc V. Order Y by reverse inclusion. 

Suppose the result is false. Then there is an e > 0 such that for every U in 
W there are points xy, yy in G with xj yy in U and |f(xy) — f(vy)| = &. 
Note that either x, or y, © K = support f. Since U= U~!, we may 
assume that x, © K for every U in Y. Now {x,: UE Y} is a net in K. 
Since K is compact, there is a point x in K such that x, > x. But 
xy'yy > e. Since multiplication is continuous, y, = xy(xjlyy) >. 
Therefore if W is any neighborhood of x, there is a U in Y with 
Xy, Yy © W. But f is continuous at x so W can be chosen such that 
f(x) — f(w)| < €/2 whenever w © W. With this choice of W, |f(x,) — 
f(yy)| < €, a contradiction. a 


One can rephrase (9.1) by saying that continuous functions on a topologi- 
cal group that have compact support are uniformly continuous. 

In the next result it is the case p = 1 which is of principal interest for us 
at this time. The proof of the general theorem is, however, no more difficult 
than this special case. 


9.2. Proposition. Jf G is a locally compact group, 1 < p< «, and fe 
L?(G), then the map x > f, is a continuous function from G into L?(G). 


PROOF. Fix f in L?(G), x inG, and e > 0; it must be shown that there is a 
neighborhood V of x such that for y in V, ||f, — fll, < ¢. First note that 
there is a continuous function ¢: G —> C having compact support such that 
lf — $l, < €/3. Let K = spt $. Note that because Haar measure is transla- 
tion invariant, for any y in G, |lf, — %]l, = IIf— ll, < €/3. Now by 
Proposition 9.1, there is a neighborhood U of e such that |¢(y) — ¢(w)| 
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< 1e{[2m(K )]~‘/” whenever y~'w & U. Put V = Ux. If y € V, then 


Is, — Ol = flo(ey) — o(zx- Pade. 


But y = ux for some u in U, so (zy ')~ (zx!) = yx”! =u &€ U. Thus 


— o IP = a — (2x7!) dz 

a [2m(K)] ‘m( Ky U Kx) 

<(5) 

Therefore if y € V, ||f. — fill, < Ite — Gell, + lox — Ol, + Ilo, — All, < & 
|| 


The aim of this section is to discuss the homomorphisms on L'(G) when 
G is abelian and to examine the Gelfand transform. There is a bit of a 
difficulty here since L'(G) does not have an identity when G is not discrete. 
If 5, is the unit point mass at e, then 6, is the identity for M(G) and hence 
acts as an identity for L'(G). Nevertheless 6, ¢ L'(G) if G is not discrete. 
All is not lost as L(G) has an approximate identity (Exercise 2.8) of a nice 


type. 


9.3. Proposition. If f € L'(G) and e > 0, then there is a neighborhood U of 
e such that if g is a non-negative Borel function on G that vanishes off U and 
has {g(x) dx = 1, then ||f — f * gil, < «. 


PROOF. By the preceding proposition, there is a neighborhood U of e such 
that ||f— fill, <« whenever y © U. If g satisfies the conditions, then 


f(x) — f *9(x) = fUf(x) -— fy Ig(y) dy for all x. Thus, 


If -f* glh = f\f reo — f(xy") g(y) dy) dx 
< J sv) ful) — f(xy") | dx dy 


= f 8S flay 
< €. b= 


9.4. Corollary. There is a net {e,;} of non-negative functions in L'(G) such 
that fe,dm = 1 for alli and \\e,* f — fil, > 0 for all f in L(G). 


PROOF. Let Y be the collection of all neighborhoods of e and order Y by 
reverse inclusion. Let Y= {U,: i © 1} where i <7 if and only if U;C U.. 
For each i in J put e, = m(U,)" "xy, soe, 2 Oand fe,dm = 1.If fe L(G) 
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and e > 0, let U, be as in the preceding proposition. So if j > i, e, satisfies 
the conditions on g in (9.3) and hence ||/f — f * e,||, < «. z 


9.5. Corollary. If h: L\(G)— C is a nonzero homomorphism, then h is 
bounded and |\h|| = 1. 


PROOF. The fact that A is bounded and ||h|| < 1 is Exercise 8.3. In light of 
the preceding corollary if h(f) # 0, h(f) = limA(f *e,) = A(f lim h(e,). 
Hence h(e,) > 1. Since |le,|| = 1 for all 7, ||A|| =1. 


Even though Haar measure on most of the popular examples is o-finite, 
this is not true in general. For example, if D is an uncountable discrete 
group, the Haar measure on D is counting measure and, hence, not o-finite. 
Similarly, Haar measure on D X R is not o-finite. Nevertheless, it is true 
that L'(G)* = L~(G) for any locally compact group because (G, m) is an 
example of a decomposable measure space. This fact will be assumed here. 
The interested reader can consult Hewitt and Ross [1963]. 


9.6. Theorem. Jf G is a locally compact abelian group and y: G>T is a 
continuous homomorphism, define f(y) by 


9.7 fly) = [£)vo) x 


for every f in L(G). Then f > fly) is a nonzero homomorphism on L(G). 
Conversely, if h: L'(G) > € is a nonzero homomorphism, there is a continu- 
ous homomorphism y: G — TV such that h(f) = f(y). 


PROOF. First note that if y: G — T is a homomorphism, y(xy) = y(x)y(y) 
and y(x_') = y(x)"' = y(x), the complex conjugate of y(x). If f,g © 
L(G), then 


fe aly) = [(f*g)(x)y(e!) dx 


= fr(x-?) [f(xy 8(y) ay ax 
= fel») fi (xy) y((xy ((2y"2)"") ae] a 


But the invariance of the Haar integral gives that [f(xy~')y((xy~!)~') dx 
= {f(x)y(x~') dx. Hence 


Fear) = [a(vdv0-)| [fdr ax| dy = FAC. 


So fr f(y) is a homomorphism. Since y is continuous and y(G) C T, 
y © LG) and |ly||,, = 1. Thus f > fly) is not identically zero. 

Now assume that h: L'(G) > C is a nonzero homomorphism. Since h is 
a bounded linear functional, there is a @ in L*(G) such that A(f) = 
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[f(x)o(x)dx and ||9$/|,, = |All = 1. If f.g @ L(G), then h(f*g)= 
[(f * B\(x)o(x) dx = [a(yL [fy *)o(x) dx] dy = [a(y)h(f,) dy. [Note 
that y+» A(f,) is continuous scalar-valued function by Proposition 9.2.] 
But h(f*g) = h(f)h(g) = fg(v)h(f)o(y) dy. So 


0= fe(y)[A(L,) — aN o(y)] & 


for every g in L'(G). But y > h(f,) — h(f)o(y) belongs to L(G), so for 
any f in L(G), 


9.8 h(f,) = h(f)o(y) 


for almost all y in G. Pick f in L‘(G) such that h(f) #0. By (9.8), 
o(y) = ACf,)/h(f) ae. But the right-hand side of this equation is continu- 
ous. Hence we may assume that ¢ is a continuous function. Thus for every f 
in L'(G), (9.8) holds everywhere. 

In (9.8), replace y by xy and we obtain h(f)o(xy) = A(f,,) = AC /),)- 
Now replace f in (9.8) by f, to get A(f.)$(y) = A(f,,). Thus h(f)o(xy) 
= h(f.)o(y) = [ACf)o(x)]o(y). If AC f) #0, this implies (xy) = 
6(x)o(y) for all x, y in G. Thus ¢: G—>C is a homomorphism and 
lo(x)| <1 for all x. But 1 = 6(e) = $(x)o(x~') = o(x)o(x) | and 
ld(x)|, |6(x)~1| < 1. Hence |@(x)| = 1 for all x in G. If y(x) = o(x"'), 
then y: G > T is a continuous homomorphism and h(f) = f(y) for all f 
in L(G). s 


Let = be the set of nonzero homomorphisms on L'(G), where G is 
assumed to be abelian (both here and throughout the rest of the chapter). So 
x C ball L'(G)*. If h € ball L'(G)* and {h,} is a net in 2 such that 
h, — h weak*, then it is easy to see that / is multiplicative. Thus the weak* 
closure of 2 C 2 U {0}. Hence the relative weak* topology on & makes 2 
into a locally compact Hausdorff space (see Exercise 8.4). 

Let I = all the continuous homomorphisms y: G — T. By Theorem 9.6, 
= and I can be identified using formula (9.7). In fact, the map defined in 
(9.7) is the Gelfand transform when this identification is made. (Just look at 
the definitions.) Since & and I are identified and 2 has a topology, I’ can 
be given a topology. Thus I’ becomes a locally compact space with this 
topology. (For another description of the topology, see Exercise 6.) The 
functions in I are called characters and are sometimes denoted by I = G 
and called the dual group. 

Also notice that in a natural way I’ is a group. If y,,y, <@ J, then 


(¥1Y2 (x) = ¥1(X) ¥2(x) and NY. € I. 
9.9. Proposition. I" is a locally compact abelian group. 


Clearly I is an abelian group and we know that I is a locally compact 
space. It must be shown that I" is a topological group. To do this we first 
prove a lemma. 
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9.10. Lemma. 


(a) The map (x,y) y(x) of G X I > T is continuous. 
(b) If {y;} is anetin I and y,; > y in T, then y,(x) > y(x) uniformly for x 
belonging to any compact subset of G. 


ProoF. First note that if x € G and f € L(G), then for every y in I, 


Rly) = [A&(v)r(o-") &y 
= [fox )y7!) & 


= [flz)y(e'e"!) 
= y(x"")f(y). 


So if y, > y in I and x; > x inG, 
f(y wlx) — fy = ati) — £2) 
= Ae (¥%) — f(y) si e-(Y;) — f,-(y)I. 


But |A-(y,) — f(y < Wher — fet 20 by (9.2). Because f,-1 
L(G), f(y) > f(y) since y;, > y. Thus f(y,)¥,(x,;) > f(y)y(x). If f 
is chosen so that f(y) # 0, then because f(y) — f(y), there is an in such 
that fy) # 0 for i > iy. Therefore y,(x;) > y(x) and (a) 1s proven. 

Now let K be a compact subset of G and let {y,} be a net in I’ such that 
Y; > Yo- Suppose {y,;(x)} does not converge uniformly on K to y)(x). Then 
there is an e > 0 such that for every i, there is a 7, > i and an x, in K such 
that ly;,(%1) — Yo(%;)| = &. Now {y; } is a net and Y;, > Yo (Exercise). Since 
K is compact, there is an x, in K such that x, -7> Xx ,. Now part (a) 
implies that the map (x,y) > (y(X), Yo(x)) of G x I into T X T is con- 
tinuous. Since (x,,7¥,) ~a (%oY%) In GX TT, (y,(%;), Yo(%))) ~ar 
(Yo(Xo), Yo(Xo)). So for any io, there is an 7 > ig such that |y,;(x;) — Yo(Xo)| 
< €/2 and |¥o(x;) — Yo(%o)| < €/2. Hence |y,(x;) — yo(x))| < & a con- 
tradiction. a 


PROOF OF PROPOSITION 9.9. Let {y,;},{A,;} be nets in I such that y, > y 
and A, — X. It must be shown that y,A;' > yA7~'. Let ¢ © CG) and put 
K= spt ¢. Then $(y,A;') = feblxy(x7DA, (x) dx. By the preceding 
lemma, y,(x~') > y(x7 1) and A,(x) > A(x) uniformly for x in K. Thus 
Gk) > $(yA7~'). If fe L(G) and e > 0, let ¢ € C(G) such that 
If — $||, < €/3. Then 


FAs) — FA) < + OAT) — 80). 


It follows that f(y,A;!) > f(yA7!) for every f in L'(G). Hence y,A7! 
yA lin I. Fal 
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Since I is a locally compact abelian group, it too has a dual group. Let I 
be this dual group. If x € G, define p(x): I > T by p(x)(y) = y(x). It is 
easy to see that p is a homomorphism. It is a rather deep fact, entitled the 
Pontryagin Duality Theorem, that p: G > I’ is a homeomorphism and an 


isomorphism. That is, G “is” the dual group of its dual group. The 
interested reader may consult Rudin [1962]. We turn now to some examples. 


9.11. Theorem. /f y € R, then y,(x) = e'*” defines a character on R and 
every character on R has this form. The map y > y, is a homeomorphism and 
an isomorphism of R onto R. If y © R and f € L'(R), then 


9.12 f(y) =flv) = fo flee?” dx, 


the Fourier transform of f. 


Proor. If yER, then ly,(%)| = 1 for all x and y,(x, + x2) = 
¥(%1)¥,(%2). So y, € R. Also, y, .,,(%) = y,,(%)y,,(x). Hence y > y, isa 
homomorphism of R into R. 

Now let y € R. y(0) = 1 so that there is a 6 > 0 such that [Oy(x) dx = a 
# 0. Thus 


ay(x) = r(x) f "y(t) dt 
= [re + t) dt 


= [vat 


Hence y(x) = a~}{2*° y(t) dt. Because y is continuous, the Fundamental 
Theorem of Calculus implies that y is differentiable. Also, 


v(x + t) =x) (x) 74= 1 | 


So y(x) = y(0)y(x). Since y(0) = 1 and |y(x)| = 1 for all x, the elemen- 
tary theory of differential equations implies that y = y, for some y in R. 
This implies that y > y, is an isomorphism of R onto R. 

It is clear from (9.7) that (9.12) holds. From here it is easy to see that 
y > y, is a homeomorphism of R onto R. = 


So the preceding result says that R is its own dual group. Because of 
(9.12), the function f as defined in (9.7) is called the Fourier transform of f. 
The next result lends more weight to the use of this terminology. 


9.13. Theorem. [fn € Z, define y,: T > T by y,(z) = 2". Then y, © T 
and the map n +> y,, is a homeomorphism and an isomorphism of Z onto V. If 
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n & Z and f € L\(1), then 
9.14 Fm) = Fn) = 55 [Hele a8. 


Proor. It is left to the reader to check that y, @ T and ny, is an 
injective homomorphism of Z into T. If y € T, define o: R > T by 
o(t) = y(e”); it follows that o € R. By (9.11), o(t) = e’”' for some y in R. 
But o(f + 277) = o(t), so e?”” = 1. Hence y = n € Z. Thus y(e’’) = 0(8) 
= eft? y = Y,, and n +> y, is an isomorphism of Z onto T. Formula (9.14) 
is immediate from (9.7). The fact that n > y, is a homeomorphism is left as 


an exercise. a 
So T = Z, a discrete group. This can be generalized. 


9.15. Theorem. If G is compact, G is discrete; if G is discrete, G is compact. 


Proor. Put I = G. If G is discrete, then L'(G) has an identity. Hence its 
maximal ideal space is compact. That is, I’ is compact. 

Now assume that G is compact. Hence [Ic L'(G) since m(G) = 1. 
Suppose y © I and y # the identity for I’, then there is a point x, in G 
such that y(x,) # 1. Thus 


r(x) dx = [r(x5 xo) dx 
= (x0) fr(xxo") dx 


= (x0) fr(x) dx, 


since Haar measure is translation invariant. Since y(x,) # 1, this implies 
that 


[(s) d& = 0 ify #1. 
G 


Of course if y = 1, f{ldx = m(G) =1. So if f=1 on G, fe L(G) and 
fly) = fy(x~*) dx = x a)(7). Since f is continuous on I, {1} is an 
open set. By translation, every singleton set in I’ is open and hence I is 
discrete. x 


9.16. Theorem. [fa <€T, define y,: Z > T by y,(n) =a". Then y, € Z 
and the map a +> y, is a homeomorphism and an isomorphism of V onto Z. If 
a € T and f € LZ) =1'(Z), then 


9.17 ij O= Sia 


n=—0 
ProoF. Again the proof that a > y, is a monomorphism of T > Z is left to 
the reader. If y € Z, let y1) =a € T. Also, y(”) = y(1)” = a", so y = y,. 
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Hence a> y, is an isomorphism. It is easy to show that this map is 
continuous and hence, by compactness, a homeomorphism. a 


For additional reading, consult Rudin [1962]. 


EXERCISES 


1. 
2. 


10. 


11. 


Prove that if L'(G) has an identity, then G is discrete. 


If fe L*(G), show that xf, is a continuous function from G into 
(L”(G), wk*). 


. Is there a measure » on R different from Lebesgue measure such that for f in 


L'(n), x  f, is continuous? Is there a measure for which this map is discon- 
tinuous? 


. If f € G(G), show that x > f, is a continuous map from G > ©)(G). 


. If fe L*(G) and f is uniformly continuous on G, show that x > f, is a 


continuous function from G — L®(G). Is the converse true? 


. If K is a compact subset of G, y © I, and e > 0, let U(K,y, 6) = {y EL: 


ly(x) — Yo(x)| < e for all x in K }. Show that the collection of all such sets is a 
base for the topology of I’. (This says that the topology on I" is the compact-open 


topology.) 


. Show that there is a discontinuous homomorphism y: R > T. If y: R > T isa 


homomorphism that is a Borel function, show that y is continuous. 


. If G is a compact abelian group, show that the linear span of I’ is dense in 


C(G). 


. If G is a compact abelian group, show that I’ forms an orthonormal basis in 


L?(G). 
If G is a compact abelian group, show that G is metrizable if and only if I’ is 


countable. 


Let {G,} be a family of compact abelian groups and G = II,G,. If I, = G.. 
show that the character group of G is {{y,} © I],I,: y, = e except for at most 
a finite number of a}. 


CHAPTER VIII 
C*-Algebras 


A C*-algebra is a particular type of Banach algebra that is intimately 
connected with the theory of operators on a Hilbert space. If # is a Hilbert 
space, then @4(#) is an example of a C*-algebra. Moreover, if 7 is any 
C *-algebra, then it is isomorphic to a subalgebra of #(#) (see Section 5). 
Some of the general theory developed in this chapter will be used in the next 
chapter to prove the Spectral Theorem, which reveals the structure of 
normal operators. 

A more thorough treatment of C*-algebras is available in Arveson [1976] 
or Sakai [1971]. 


§1. Elementary Properties and Examples 


If x is a Banach algebra, an involution is a map a > a* of # into Y such 
that the following properties hold for a and b in & and a in C: 

(1) (a*)* = a; 

(11) (ab)* = b*a*; 
(i1) (aa + b)* = aa* + b*. 

Note that if . has involution and an identity, then 1* - a = (1* - a)** 

= (a* - 1)* = (a*)* = a; similarly, a - 1* = a. Since the identity is unique, 
* = |. Also, for any a in C, a* = a. 


1.1. Definition. A C*-algebra is a Banach algebra .% with an involution 
such that for every a in X%, 


a 2 
||a*al| = |la||*. 
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1.2. Example. If #7 is a Hilbert space, W~@= &(#) is a C*-algebra where 
for each A in @(4#), A* = the adjoint of A. (See Proposition II.2.7.) 


1.3. Example. If # is a Hilbert space, 4)(#) 1s a C*-subalgebra of 
B(#), though #)(#) does not have an identity. 


1.4. Example. If X is a compact space, C(X) is a C*-algebra where 
f *(x) = f(x) for f in C(X) and x in X. 


1.5. Example. If (X,0,) is a o-finite measure space, L*(X,0, uw) is a 
C *-algebra where the involution is defined as in (1.4). 


1.6. Example. If X is locally compact but not compact, C)(X) 1s a 
C *-algebra without identity. 

1.7. Proposition. If 7 is a C*-algebra and a © &, then ||a*\| = |lal|. 
PROOF. Note that |/a||* = ||a*a|| < ||a*||||a||; so |ja|| < ||a*||. Since a = a**, 


< 
substituting a* for a in this inequality gives ||a*|| < |lalj. = 


1.8. Proposition. If x is a C*-algebra and a © &, then 


||a|| = sup{||ax|]: x ©, ||x|| < 1} 


sup{||xal|: x © 2, ||x|] < 1). 


ProoF. Let a = sup{||ax||: x € &, ||x|| < 1}. Then ||ax|| < |Ja||||x|| for any 
x in &; hence a < jlal|. If x = a*/|lal|, then ||x|| = 1 by the preceding 
proposition. For this x, ||ax|| = ||a||, so a = |la||. The proof of the other 
equality is similar. fs) 


This last proposition has an alternate formulation that is useful. If 
a&W, define L,; P-L by L(x) = ax. By (1.8), L, © B(M) and 
iL || = llall. If p: 2% B(x) 1s defined by p(a)=L,, then p is a 
homomorphism and an isometry. That is, . is isometrically isomorphic to 
a subalgebra of @(.%). The map p is called the /eft regular representation 
of XH. 

The left regular representation can be used to discuss the process of 
adjoining an identity to .. Since is isomorphic to a subalgebra @( 7) 
and @(.%) has an identity, why not just look at the subalgebra of B(x) 
generated by . and the identity operator? Why not, indeed. This is just 
what is done below. 


1.9. Proposition. If. is a C*-algebra, then there is a C*-algebra &, with 
an identity such that %, contains @ as an ideal. If x does not have an 
identity, then ,/& is one dimensional. If © is a C*-algebra with identity, 
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and v: Z—> FE is a *-homomorphism, then v,: Y%, — ©, defined by v,(a + 
a) = vp(a) +a forain Mand ain C, is a *-homomorphism. 


PROOF. It may be assumed that .% does not have an identity. Let ~, = {a 
+a: a€ P%,a = C} (a+ a is just a formal sum). Define multiplication 
and addition in the obvious way. Let (a + a)* = a* + @ and define the 
norm on x, by 


la + al] = sup{|lax + ax||: x ©, ||x|| <1}. 


Clearly, this is a norm on .%,. It must be shown that ||y*y|| = |ly||? for 
every y in &,. 

Fix a in Y and ain C. If e > 0, then there is an x in o& such that 
|x|| = 1 and 


ja + all? — & < |lax + ax||* = ||(x*a* + ax*)(ax + ax)|| 
= ||x*(a + a)*(a + a)x|| < ||(a + a)*(a t+ a). 


Thus |ja + all? < |\((a + a)*(a + a)||. 

It is left to the reader to prove that the norm on .», makes x, a Banach 
algebra. For the other inequality, note that ||(a + a)*(a + a)|| < ||(a + 
a)*||||a + al|. So the proof will be complete if it can be shown that 
(a + a)*|| < |la + all. Now if x, y © and |Ixll, lly] <1, then |ly(a + 
a)*x|| = |lya*x + ayx|| = |lx*ay* + ax*y*|| = ||x*(a + a)y*|| < lla + all. 
Taking the supremum over all such x, y gives the desired inequality. 

It remains to prove the statement concerning the *-homomorphism »p. 
Note that being a *-homomorphism means, besides being an algebra 
homomorphism, that v(a*) = v(a)*. The details are left to the reader. 

i 


If Y is a C*-algebra with identity and a € x#, then o(a), the spectrum 
of a, is well defined. If . does not have an identity, o(a) is defined as the 
spectrum of a as an element of the C*-algebra ., obtained in Proposition 
1.9. 


1.10. Definition. If 7 is a C*-algebra and a € W&, then (a) a is hermitian 
if a = a*; (b) a is normal if a*a = aa*; (c) a is unitary if a*a = aa* = 
(this only makes sense if . has an identity). 


1.11. Proposition. Let 2 be a C*-algebra and let a © &. 


(a) If a is invertible, then a* is invertible and (a*)~' = (a7')*. 

(b) a = x + iy where x and y are hermitian elements of 2. 

(c) If u is a unitary element of &, \\u|| = 1. 

(d) If @ is a C*-algebra and p: Y—> B is a *-homomorphism, then 
le(a)ll < |lall. 

(e) Ifa = a%*, then |\a|| = r(a). 
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PROOF. The proofs of (a), (b), and (c) are left as exercises. 

(e) Since a* = a, ||a?|| = ||a*a|| = ||a||*; by induction, |ja*"|| = |ja||*” for 
n> 1. That is, ||a?"||'7*" = |la|| for n > 1. Hence r(a) = lim|ja?"||'/7" = 
|||. 

(d) A *-homomorphism p: ~- # is an algebra homomorphism such 
that p(a)* = p(a*) for all a in &. If #W has an identity, it is not assumed 
that o(1) = the identity of &. However, it is easy to see that p(1) is the 
identity for clp(.#). If . does not have an identity, then p can be 
extended to a *-homomorphism p,: #~, — &, such that p,(1) = 1 (1.9). 
Thus it suffices to prove the proposition under the additional assumption 
that »& and @& have identities and p(1) = 1. 

If x € YM, then it follows that o(p({x)) C o(x) (Verify!) and hence 
r(p(x)) < r(x). So, using part (e) and the fact that a*a is hermitian, 
e(a)||? = ||p(a*a)|| = r(p(a*a)) < r(a*a) = |ja*al] = |lal|. 


1.12. Proposition. [f © is a C*-algebra and h: &— C is a homomorphism, 
then: 


(a) h(a) © R whenever a = a*; 

(b) h(a*) = h(a) for all a in 9; 

(c) h(a*a) = 0 for alla in ®&; 

(d) if 1 © PM and u is unitary, then |h(u)| = 1. 


PROOF. If has no identity, extend h to x, by letting 4(1) = 1. Thus, we 
may assume that . has an identity. By Exercise VII.8.1, ||h|| = 1. If 
a=a* andtER, 


|h(a + it)? < lla + it||* = |\(a + it)*(a + it)|| 
= ||(a — it)(a + it)| 
= |lja? + t?|| < |lal|* + ¢’. 
If h(a) = a+ if, a, B in R, then this yields 


lal? + 27> la + i(B+r)/° 


a’ +(B + 1)” 


a? + B* + 2Bt+ t’; 


hence |la||? > a? + B* + 28r for all t in R. If B # 0, then letting t > +00, 
depending on the sign of B, gives a contradiction. Therefore B = 0 or 
h(a) € R. This proves (a). 

Let a = x + iy, where x and y are hermitian. Since h(x), h(y) © R by 
(a) and a* = x — iy, (b) follows. Also, h(a*a) = h(a*)h(a) = |h(a)|? = 0, 


VIII.1. Elementary Properties and Examples 241 


so (c) holds. Finally, if w is unitary, |h(u)|? = h(u*)h(u) = h(u*u) = h(1) 
=]. a 


Note that part (b) of the preceding proposition implies that any homo- 
morphism h: »&-— C is a *-homomorphism. This, coupled with (VII.8.6), 
gives the following corollary. 


1.13. Corollary. [f o is an abelian C*-algebra and a is a hermitian element 
of WS, then o(a) CR. 


This corollary is short-lived as the conclusion remains valid even if . is 
not abelian. 


1.14. Proposition. Let < and @ be C*-algebras with identities such that 
ACB. Ifa =X, then o,(a) = og(a). 


PROOF. First assume that a is hermitian and let @ = C*(a), the C*-algebra 
generated by a and 1. So @ is abelian. By Corollary 1.13 o,(a) C R. By 
Theorem VII.5.4, o,(a) € og(a) = doy(a) C o,(a); so 0,(a) = o,(a) C 
IR. By similar reasoning, og(a) = o,(a), and hence o,(a) = og(a). 

Now let a be arbitrary. It suffices to show that if a is invertible in @, a is 
invertible in ./. So suppose there is a b in @ such that ab = ba = 1. Thus, 
(a*a)( bb*) = (bb*)(a*a) = 1. Since a*a is hermitian, the first part of the 
proof implies a*a is invertible in .. But inverses are unique, so bb* = 
(a*a)~' € &. Hence b = b(b*a*) = (bb*)a* EX. i 


This result must, of course, be contrasted with Theorem VII.5.4. 


EXERCISES 


1. Verify the statements made in Examples 1.2 through 1.6. 


2. Let W~= {fe C(clD): f is analytic in D} and for f in # define f* by 
f*(z) = f(z). Show that x is a Banach algebra, f* €.% when f €.%, and 
Lf * || = IL], but 7 is not a C*-algebra. 


3. If {.,: i © I} is a collection of C*-algebras, show that ©, wv, and ®,.%, are 
C*-algebras. 


4. Let X be a locally compact space and let YW be a C*-algebra. If C,(X, W%) = the 
collection of bounded continuous functions from X — &%, show that C,(X, #) is 
a C*-algebra. Let C)(X, #%) = all of the continuous functions f: X > # such 
that for every « > 0, {x © X: ||f(x)|| = e} is compact. Show that C)(X, #) is a 
C*-algebra. 
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§2. Abelian C*-Algebras and the Functional Calculus 
in C*-Algebras 


The next theorem is the basic result of this section. It will be used to develop 
a functional calculus for normal elements that extends the Riesz Functional 
Calculus. 


2.1. Theorem. Jf © is an abelian C*-algebra with identity and & is its 
maximal ideal space, then the Gelfand transform y: &— C() is an 
isometric *-isomorphism of & onto C(2). 


PrRooF. By Theorem VII.8.9, ||X||,, < ||x|| for every x in .. But ||X||,, 1s the 
spectral radius of x, so by (1.11e), ||x|| = ||*||,, for every hermitian element 
x of &. In particular, ||x*x|| = ||x*x||,, for every x in a. 

If aev and he, then 4*(h) = h(a*)=h(a) = a(h). That is, 
a* = 4. Equivalently, y(a*) = y(a)* since the involution on C() is de- 
fined by complex conjugation. Thus, y is a *-homomorphism. Also, la\|* = 


]a*al| = |la*al),, = |4["lla = llallz; therefore |Jal| = |J4l|,, and y is an 
isometry. 

Because y is an isometry, it has closed range. To show that y 1s surjective, 
therefore, it suffices to show that it has dense range. This is accomplished by 
applying the Stone—Weierstrass Theorem. Note that 1=1, so y(#) is a 
subalgebra of C(2) containing the constants. Because y preserves the 
involution, y(.%) is closed under complex conjugation. It remains to show 
that y(.%) separates the points of 2. But if A, and h, are distinct 
homomorphisms in 2, they are distinct because there is an a in &w such 
that h,(a) #h,(a). Hence a(h,) # a(h,). cal 


By combining the preceding theorem with Proposition 1.9 and Exercise 
VII.8.6, the following is obtained. 


2.2. Corollary. If & is an abelian C*-algebra without identity and & is its 
maximal ideal space, then the Gelfand transform y: W%— C,(2) is an 
isometric *-isomorphism of Y onto C,(2). 


In order to focus our attention on the key concepts and not be distracted 
by peripheral considerations, we now make the following. 


Assumption. A// C*-algebras that are considered have an identity. 


Let @ be an arbitrary C *-algebra and let a be a normal element of &. So 
if x= C*(a), the C*-algebra generated by a (and 1), © is abelian. Hence 
xf = C(), where & is the maximal ideal space of .. So by Theorem 2.1 if 
f © C(%), there is a unique element x of » such that X = f. We want to 
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think of x as f(a) and thus define a functional calculus for normal elements 
of a C*-algebra. To be useful, however, we should have a ready way of 
identifying 2. Moreover, since x = C*(a) and thus depends on a, it should 
be that 2 depends on a in a clear way. The idea embodied in Proposition 
VII.8.10 that 2 and o(a) are homeomorphic via a natural map is the key 
here, although (VII.8.10) is not directly applicable here since a is not a 
generator of C*(a) as a Banach algebra but only as a C*-algebra. [If 
a = a*, then a is a generator of C*(a) as a Banach algebra.] Nevertheless 
the result is true. 


2.3. Proposition. Jf is an abelian C*-algebra with maximal ideal space X 
and a © & such that Z= C*(a), then the map t: X — o(a) defined by 
T(h) = h(a) is a homeomorphism. If p(z, Z) is a polynomial in z and z and 
y: L—> C(2) is the Gelfand transform, then y( p(a, a*)) = per. 


The proof of this result follows, with a few variations, along the lines of 
the proof of Proposition VII.8.10 and is left to the reader. 

If +: & — o(a) is defined as in the preceding proposition, then 7*: 
C(o(a)) > C(2%) is defined by r*(f) = for. Note that +* is a *-isomor- 
phism and an isometry, because 7 is a homeomorphism. Note that <= 
C*(a) is the closure of { p(a,a*): p(z,Z) is a polynomial in z and 2}. 
Now such a polynomial p(z,Z) is, of course, a function on o(a). [Just 
evaluate the polynomial at any z in o(a).] The last part of (2.3) says that 
y( p(a, a*)) = 7*(p). We define a map p: C(o(a)) > C*(a) so that the 
following diagram commutes: 


C*(a) —— C2) 


2.4 ps J 
C(o(a)) 


Note that if @ is any C*-algebra and a is a normal element of &, then 
= C*(a) is an abelian C*-algebra contained in @ and so (2.4) applies. 
Moreover, in light of Proposition 1.4, the spectrum of a does not depend on 
whether a is considered as an element of ~ or &. The following definition 
is, therefore, unambiguous. 


2.5. Definition. If @ is a C*-algebra with identity and a is a normal 
element of @, let p: C(o(a)) — C*(a) C @ be as in (2.4). If f © C(a(a)), 
define 


f(a) = e(f). 
The map f > f(a) of C(o(a)) > @ is called the functional calculus for a. 
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Note that if p(z,z) is a polynomial in z and Zz, then p(p(z,Z))= 
p(a, a*). In particular, p(z"Z") = a”a*” so that p(z) = a and p(Z) = a™*. 
Also, p(1) = 1. 

The properties of this functional calculus can be obtained from the fact 
that p is an isometric *-isomorphism of C(o(a)) into @—with one excep- 
tion. How does this functional calculus compare with the Riesz Functional 
Calculus? If f € Hol(a), flo(a) € C(o(a)); so f(a) has two possible inter- 
pretations. Or does it? 


2.6. Theorem. If @ is a C*-algebra and a is a normal element of &, then 
the functional calculus has the following properties. 


(a) f > f(a) is a *-monomorphism. 


(b) IIf(@)Il = Ill... 


(c) f > f(a) is an extension of the Riesz Functional Calculus. 


Moreover, the functional calculus is unique in the sense that if t: C(o(a)) 
— C*(a) is a *-homomorphism that extends the Riesz Functional Calculus, 


then t(f) = f(a) for every f in C(0(a)). 


PROOF. Let p: C(o(a)) ~ C*(a) be the map defined by p( /) = f(a). From 
(2.4), (a) and (b) are immediate. 

Let 7: Hol(a) ~ “~C # denote the map defined by the Riesz Functional 
Calculus. Since p(z) = 7(z)= a, an algebraic manipulation gives that 
e(f) = a(f) for every rational function f with poles off o(a). If f <= Hol(a), 
then by Runge’s Theorem there is a sequence {/f,} of such rational 
functions such that f,(z) > f(z) uniformly in a neighborhood of o(a). 
Thus 7(f,) > 7(f). By (b), o(f,) > p(f). Thus p(f) = 7(f). 

To prove uniqueness, let +: C(o(a)) ~ & be a *-homomorphism that 
extends the Riesz Functional Calculus. If f € C(o(a)), then there is a 
sequence { p,} of polynomials in z and Zz such that p,(z,z) > f(z) 
uniformly on o(a). But t( p,,) = p,(a, a*), t(p,) > T(f), and p,(a,a*) > 
f(a). Hence r(f) = f(a). a 


Because of the uniqueness statement in the preceding theorem, it is not 
necessary to remember the form of the functional calculus f+> f(a), but 
only the fact that it is an isometric *-monomorphism that extends the Riesz 
Functional Calculus. Indeed, by the uniqueness of the Riesz Functional 
Calculus, it suffices to have that f > f(a) is an isometric *-monomorphism 
such that if f(z) = 1, then f(a) = 1, and if f(z) = z, then f(a) = a. Any 
properties or applications of the functional calculus can be derived or 
justified using only these properties. There may, however, be an occasion 
when the precise form of the functional calculus [viz., (2.4)] facilitates a 
proof. There are also situations in which the definition of the functional 
calculus gets in the way of a proof and the properties in (2.6) give the clean 
way of applying this powerful tool. 
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2.7. Spectral Mapping Theorem. If ./ is a C*-algebra and a is a normal 
element of , then for every f in C(o(a)), 


o(f(a)) = f(o(a)). 


Proor. Let p: C(o(a)) > C*(a) be defined by p(f) = f(a). So p is a 


*-isomorphism. Hence o(/f(a)) = o(p(f)) = o(f). But o(f) = f(o(a)) 
(VII.3.2). a 


Once again (1.14) was used implicitly in the preceding proof. 


EXERCISES 


1. Prove a converse to Proposition 2.3. If K is a compact subset of C, C(K) is a 
singly generated C*-algebra. 


2. If x is an abelian C*-algebra with a finite number of C*-generators a,,...,a,,, 
then there is a compact subset X of C” and an isometric *-isomorphism p: 
xf — C(X) such that p(a,)=z,, 1 <k <n, where z,(Aj,...,A,) =A, (see 
Exercise VII.8.12). 


3. A Stonean space is a compact space X such that the closure of every open subset 
of X is open. (a) Show that the Cantor set is a Stonean space. (b) Show that a 
compact space X is a Stonean space if and only if each connected subset of X is 
a singleton set. (c) Show that X is a Stonean space if and only if C(X) is the 
closed linear span of its projections (= hermitian idempotents). 


4. Using the terminology of Exercise 3, show that if (X, 2,1) is a o-finite measure 
space, the maximal ideal space of L®(X, 2, w) is a Stonean space. 


5. If W isa C*-algebra with identity and a = a*, show that exp(ia) = u is unitary. 
Is the converse true? 


6. Let X be compact and fix a point x) in X. Let W={{f,}: f, © CCX), 
sup,,||f,,|| < oo, and {f,(x,))} 1s a convergent sequence}. Show that is an 
abelian C*-algebra with identity and find its maximal ideal space. 


7. If X is completely regular, then C,(X) is a C*-algebra and its maximal ideal 
space is the Stone—Cech compactification of X. 


§3. The Positive Elements in a C*-Algebra 


This section is an application of the functional calculus developed in the 
preceding section. The results here are very useful in the study of operators 
on a Hilbert space and they demonstrate the power of the functional 
calculus. 

If Y is a C*-algebra, let Re. denote the hermitian elements of 7. 
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3.1. Definition. If © is a C*-algebra and a © a, then a 1s positive if 
a © Re and o(a) C (0, 00). If a is positive, this is denoted by a > 0. Let 
SW, be the set of all positive elements of 7. 


3.2. Example. If = C(X), then f is positive in & if and only if 
f(x) = 0 for all x in X. 


3.3. Example. If “= L®(p) and fe L*(p), then f > 0 if and only if 
f(x) => 0 ae. [p]. 


3.4. Proposition. Jf a € Re, then there are unique positive elements u,v 
in SS such that a = u — v and uv = vu = 0. 


Proor. Let f(t) = max(t,0), g(t) = —min(t,0). Then f,g <€ C(IR) and 
f(t) — g(t) = t. Using the functional calculus, let u = f(a) and v = g(a). 
So uw and v are hermitian and by the Spectral Mapping Theorem u,v > 0. 
Also, u — v = f(a) — g(a) = a and w = vu = (gf )(a) = 0 since fg = 0. 

To show uniqueness, let u,,v, € %, such that u, — v,; =a and u,v, = 
vu, = 0. Let { p,,} be a sequence of polynomials such that p,(0) = 0 for all 
n and p(t) — f(t) uniformly on o(a). Hence p,(a)—>u in x. But 
u,a = au,. So u,p,(a) = p,(a)u, for all n; hence u,u = uu,. Similarly, it 
follows that a, u, v, u,, and v, are pairwise commuting hermitian elements 
of Y%. Let @= the C*-algebra generated by a, u, v, u,, and v,; so 1s 
abelian. Hence 4 = C(2) where ~ is the maximal ideal space of @. The 
uniqueness now follows from the uniqueness statement for C(2) (Exercise 
1). a 


The next result follows in a similar way. 


3.5. Proposition. Jf a ©, and n > 1, there is a unique element b in &, 
such that a = b". 


The decomposition a = u—v of a hermitian element a is sometimes 
called the orthogonal decomposition of a. The elements u and vu are called 
the positive and negative parts of a and are denoted by u =a, andv=a_. 
Note that a_> 0. 

If a € #,, then the unique b obtained in (3.5) is called the nth root of a 
and is denoted by b = a!’". Note that if b is not assumed to be positive, it 
is not necessarily unique (see Exercise 5). 

If X is compact and f€ C(X),, then notice that |f(x)—¢| <7¢ for 
every real number ¢ > ||/||. Conversely, if |f(x) — ¢| < ¢ for some ¢ > ||/lI, 
then f(x) = 0 for all x and so f= 0. These observations are behind some 
of the statements in the next result. 


3.6. Theorem. [f x is a C*-algebra and a © &, then the following state- 
ments are equivalent. 


(a) a= 0. 
(b) a = b? for some b in Re. 
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(c) a = x*x for some x in &. 
(d) a = a® and \|t — a\| < t for allt > |lalj. 
(e) a = a* and ||t — al| < t for some t > |lall. 


PROOF. It is clear that (b) implies (c) and (d) implies (e). By (3.5), (a) implies 
(b). 

(e) = (a): Since a = a*, C*(a) is abelian. If X = o(a), X CR and 
f > f(a) is a *-isomorphism of C(X) onto C*(a). Using this isomorphism 
and (e), |¢ — x| <7¢ for some ¢ > |la|| = sup{|s|: s © X} and all x in X. 
From the discussions preceding this theorem (with f(x) = x), x = 0 for all 
x in X. That is, X = o(a) C [0, co). Hence a => 0. 

(a) = (d): This proof follows the lines of the preceding paragraph and is 
left to the reader. 

(c) = (a): If a = x*x for some x in W&%, then it is clear that a = a*. Let 
a = u — v, where u,v > O and wy = vu = 0. It must be shown that v = 0. 

If xv'/* = b + ic, where b,c € Rew, then (xu'/*)*(xu'/*) = (b — ic) 
(b + ic) = b* + c? + i(bc — cb). But also (xu'/*)*(xv'/7) = u'/?x *xy'/? = 
v'/?(u — v)v'/? = —v*. So the uniqueness of the Cartesian decomposition 
implies that b? + c? = —v* and bc = cb. Thus &, the C*-algebra gener- 
ated by b and c, is abelian. Hence if A € o(b? + c”), there is a homomor- 
phism h: @—>C such that A = h(b? + c?) = h(b)* + h(c)*. Since 
h(b),h(c) € R (1.12), 4 > 0. Thus 6b? + c? €.¥,. But —(b* + c*) =v" 
and the same type of argument shows that v* € .,. Thus v* © .&, N(—W#, ). 
By Proposition 3.7 below, v? = 0. Since v > 0, v = 0(3.5). 


The next result will be proved only using the equivalence of (a), (d), and 
(ec) from the preceding theorem. 


3.7. Proposition. If .< is a C*-algebra, then , is a closed cone. 


PROOF. Let {a,,} C .@, and suppose a, — a. Clearly a © Rew. By (3.6d), 
lla, — ll@,|lll < |l@,||. Hence ||a — ||a|||{ < lal], so by (3.6e), a = 0. 

Clearly, Aa > O if a>O0 and AO. Let a,b & H,; it must be shown 

that a + b = O. It suffices to assume that ||a]|, ||b|| < 1. But ||] — 3(a + b)|| 
= 41 — a) + (1 — b)|| < 1 by (3.6d). So by (3.6e), 5(a + b) > 0. 

Ifa € %,1(—-#,), then a = a* and o(a) = {0}. But ||a|| = r(a) (1.11e). 

| 


Now to look at one more example—a very important one. 


3.8. Theorem. /f # is a Hilbert space and A © B(#), then A => 0 if and 
only if (Ah, h) = 0 for allh in #. 


ProoF. If A => 0, then (3.6c) A = 7*T for some T in 4(#). Hence 
(Ah, h) = ||Th||? => 0. Conversely, suppose (Ah, h) > 0 for all A in #%. By 
(1.2.12), A = A*. It remains to show that o(A) C [0, 00). If h © # and 
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A < 0, then 
(A — AJA? = [AAI]? — 2ACAA, AY + HAI? 
> —2A(Ah,h) + dIIJAll? = AXIAL? 


since A <0 and (Ah,h) = 0. By (VII6.4), A € 6,,(A). But this implies 
that A — A is left invertible (Exercise VII.6.5). Since A — A 1s self-adjoint, 
A — X is also right invertible. Thus A ¢ o( A) and A = 0. zi 


3.9. Definition. If . is a C*-algebra and a,b € Rew, then a < b if 
b-acTZX,. 


This ordering makes a C*-algebra into a partially ordered vector space 
(over C). 

Note that if A and B are hermitian operators on the Hilbert space #, 
then A < B if and only if (Ah,h) < (Bh,h) for all h in #. 

This section closes with an application of positivity to obtain the polar 
decomposition of an operator. If A € C, then A = |Ale’® for some 9; this is 
the polar decomposition of 4. Can an analogy be found for operators? To 
answer this question we might first ask what is the analogy of |A| and e? 
among operators. If A € @(#), then the proper definition for |A| would 
seem to be |A| = (A*A)!/? [see (3.5)]. How about an analogy of e’’? Should 
it be a unitary operator? An isometry? For an arbitrary operator neither of 
these is correct. The following new class of operators is needed. 


3.10. Definition. A partial isometry is an operator W such that for A in 
(ker W)+, ||Wh|| = ||A||. The space (ker W)~ is called the initial space of 
W and the space ran W is called the final space of W. See Exercises 15-20 
for more on partial isometries. 


3.11. Polar Decomposition. Jf A <€ @(#), then there is a partial isometry W 
with (ker A)* as its initial space and cl(ran A) as its final space such that 
A = W\A|. Moreover, if A = UP where P > 0 and U is a partial isometry 
with initial and final spaces (kerA)~ and cl\(ran A), respectively, then 
P = |A| and U = W. 


Proor. If h € #, then ||Ah||* = (Ah, Ah) = (A*Ah,h) = (AJA, |AA). 
Thus 
3.12 AA||? = || LAIAII?. 


Since (ran A*)+ = ker A, ran A* is dense in (kerA)*. If fe ran A*, f= 
A*g for g in (ker A*)*= clran A. Therefore, { A*Ak: k © #} is dense in 
cl{ran A*] = (ker A)+. But A*Ak = |A|?k = |A|h, where h = |A|k. That is, 
{|A|h: h © 3} is dense in (ker A)*. If W: ran|A| > ran A is defined by 


3.13 W(\A|h) = Ah, 
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then (3.12) implies that W is a well-defined isometry. Thus W extends to an 
isometry W: (ker A) ~ — cl(ran A). If Wh = 0 for h in ker A, W is a partial 
isometry. By (3.13), W|A| = A. 

For the uniqueness, note that A*A = PU*UP. Now U*U = E = the 
projection onto the initial space of U (Exercise 16), (ker A)*+. But ker A D 
ker P, so (ker A)~ C (ker P)*+ = cl(ran P), since P = P*. Hence EP = P. 
Thus A*A = PEP = P*. By the uniqueness of the positive square root, 
P = |A|. Since A = U|A|, U|A|h = Ah = W\A|h. That is, U and W agree 
on a dense subset of their common initial space. Hence U = W. | 


EXERCISES 


1. Prove the uniqueness statement in Proposition 3.4 for the case that Ww is 
abelian. 


2. Prove Proposition 3.5. 


3. Let A € B(L?(0,1)) be defined by (Af)(t) = tf(t). Show that A > 0 and find 
Asn. 


4. Let (X, 2, w) be a o-finite measure space, let ¢ € L*(X, 0, u), and define M, 
as in Theorem II.1.5. Show that M, > 0 if and only if ¢(x) > 0 ae. [wu]. What 
is Mj/"? If M, € Re @(#), find the positive and negative parts of M,. 


5. Find an example of a positive operator on a Hilbert space that has a nonhermi- 
tian square root. 


. If a € Rew&, show that jal = (a*)'”* =a,4+a4_. 
. If a€ &,, show that x*ax € YW, for every x in #. 


_ If a,b € M,0 < a < b,and a is invertible, then b is invertible and b=! < a™!. 


oO CO NS WH 


. If a,b © ReA, a<b, and ab= ba, then f(a) < f(b) for every increasing 
continuous function f on R. 


10. If a € Re# and |la|| < 1, show that a is the sum of two unitaries. (Hint: First 
solve this for = C.) 


11. If a> 0, define f.: (-—a”!,o) >R by f£,(t)=t/1 + at)=a ffl -( + 
at)~']. Show: 
(a) Ifa< bin &, f(a) < f,(d) for all a > 0; 
(b) f,(t) < min{t,a~'} for t > 0; 
(c) lim, .of,(¢) = ¢ uniformly on bounded intervals in [0, oo); 
(d) ifa<B, f. <fp; 
(e) fy ° fe = tadap 


(f) hm,_.,,.af,(¢) = 1 uniformly on bounded intervals in [0, 00). 


12. If a,b € %, and a < J, show that a® < b8 for0 < B < 1. (a® = f(a) where 
f(t) = 8.) (Hint: Let f, be as in Exercise 11 and show that [2f,(t)a-*’da = 
yt® where y > 0. Use the definition of the improper integral and the functional 
calculus.) 
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13. 


14. 


15. 


16. 


17. 


18. 


19. 


20. 


21, 


22, 


23. 


24. 
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Give an example of a C*-algebra W and positive elements a,b in # such that 
a<bbutb?-aE€éwy,. 


Let “= &(I*), let a = the unilateral shift on /?, and let b = a*. Show that 
o(ab) # o(ba). 


Let W € @(#) and show that the following statements are equivalent: (a) W is 
a partial isometry; (b) W* is a partial isometry; (c) W*W is a projection; (d) 
WW* is a projection; (ec) WW*W = W; (f) W*WW* = wW*. 


If W is a partial isometry, show that W*W is the projection onto the initial 
space of W and WW* is the projection onto the final space of W. 


If W,, W, are partial isometries, define W, < W, to mean that W*W, < Wy*W,, 
WW < W,W +, and W,h = W,h whenever h is in the initial space of W,. 
Show that < is a partial ordering on the set of partial isometries and that a 
partial isometry W is a maximal element in this ordering if and only if either W 
or W* is an isometry. 


Using the terminology of Exercise 17, show that the extreme points of ball 4(#) 
are the maximal partial isometries. 


Find the polar decomposition of each of the following operators: (a) M, as 
defined in (II.1.5); (b) the unilateral shift; (c) the weighted unilateral shift 
[A(x,,%5,...) = (0, a,x), @) x2, ...) for x in /? and sup,|a,,| < 00] with non- 
zero weights; (d) A @ a (in terms of the polar decomposition of A). 


Let A € @(#) such that kerA = (0) and A > 0 and define S on #= #0 # 
®--- by S(h,,h,, ...) = (0, Ah,, Ah,, ...). Find the polar decomposition of 
S, S = W\S|, and show that S = |S|W. 


Show that the parts of the polar decomposition of a normal operator commute. 


If A € @(#), show that there is a positive operator P and a partial isometry 
W such that A = PW. Discuss the uniqueness of P and W. 


If A is normal and o(A)M { re!®: r>Oanda<06@< B} =O, where0 < B-a 
< 2a, show that the parts of the polar decomposition of A belong to C*(A). 


Give an example of a normal operator A such that the partial isometry in the 
polar decomposition of A does not belong to C*(A). 


§4*. Ideals and Quotients for C*-Algebras 


We begin with a basic result. 


4.1. 


Proposition. If I is a closed left or right ideal in the C*-algebra &, 


a © I with a = a*, and if f € C(o(a)) with f(0) = 0, then f(a) € I. 


ProoF. Note that if J is proper, then 0 € o(a) since a cannot be invertible. 
Since o(a) C R, the Weierstrass Theorem implies there is a sequence { p, } 
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of polynomials such that p,(t) > f(t) uniformly for ¢ in o(a). Hence 
P, (9) — f(0) = 0. Thus q,(t) = p,(t) — p,(0) ~ f(t) uniformly on o(a) and 
q, (0) = 0 for all n. Thus q,(a) € I and by the functional calculus, ||q,,(a) 
— f(a)|| - 0. Hence f(a) € I. B 


4.2. Corollary. If I is a closed left or right ideal, a € I with a = a*, then 
a,, a_, |a|, and |a|'? ET. 


Note that if J is a left ideal of 7, then {a*: a © J} is a right ideal. 
Therefore a left ideal J is an ideal if a* € J whenever a &€ I. 


4.3. Theorem. Jf J is a closed ideal in the C*-algebra , then a* © I 
whenever a € I. 


PROOF. Fix a in J. Thus a*a € J since J is an ideal. The idea is to construct 
a sequence {u,,} of continuous functions defined on [0, oo) such that 


aa (i) u,(0) = Oand u,(t) > 0 for all ; 
(ii) lau,(a*a) — al|| > Oasn > co. 
Note that if such a sequence {u,} can be constructed, then u,(a*a) > 0 
and u,(a*a) € I by Proposition 4.1. Also, u,(a*a)a* € I since J is an 
ideal and ||u,(a*a)a* — a*|| = ||au,(a*a) — a|| > 0 by (ii). Thus a* ET 
whenever a © J. It remains to construct the sequence {u, }. 
Note that 


\|au,,(a*a) — al]? 
= \| | au, (a*a) ~- a|*[au,(a*a) —al| 
= ||u,,(a*a)a*au,(a*a) — a*au,(a*a) — u,(a*a)a*a + a*al|. 


If b = a*a, then the fact that bu,(b) = u,(b)b implies that ||au,(a*a) — 
a\|? = |If,(b)I| < sup{If,(t)|: t= 0}, where f,() = 1m,(t)? — 2tu,(t) +1 
= t[u,(t) — 1]. If u,(t) = ant for0<t<n7! and u(t)=1 for t>n7}, 
then it is seen that sup{|f,(7)|: ¢ => 0} = 4/27n > 0 as n > ©; so (4.4) is 
satisfied. x 


Notice that the construction of the sequence { u,,} satisfying (4.4) actually 
proves more. It shows that there is a “local” approximate identity. That is, 
the proof of the preceding theorem shows that the following holds. 


4.5. Proposition. If < is a C*-algebra and I is an ideal of x, then for every 
ain I there is a sequence {e,,} of positive elements in I such that: 
(a) e, <e, < ++: and |le,|| <1 forall n; 


(b) |lae, — al| ~ 0 asn— o. 


In the preceding proposition the sequence {e,,} depends on the element 
{a}. It is also true that there is a positive increasing net {e,} in J such that 
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lle,a — al|| — 0 and ||ae, — a|| > O for every a in I (see p. 36 of Arveson 
[1976)}). 
We turn now to an important consequence of Theorem 4.3. 


4.6. Theorem. If / is a C*-algebra and I is a closed ideal of , then for 
each a+I1 in @/I define (a+ 1)* =a* +I. Then /I with its quotient 
norm is a C*-algebra. 


To prove (4.6), a lemma is needed. 


4.7. Lemma. If I is an ideal in a C*-algebra % and a © &, then |\ja + I] 
= inf{||a — ax||: x ET, x = 0, and ||x|| < 1}. 


ProoF. If (ball 7), = {x € ball J: x > 0}, then clearly |ja + J|| < inf{||a — 
ax||: x & (ball), } since al Cc I. Let y € J and let {e,,} be a sequence in 
(ball 7), such that |ly — ye,|| ~ 0 as n> co. Now 0<1-—e, <1, so 
l(a + yA — e,)I| S lla + yl]. Hence 


a + y|| > liminfl|(a + y)(1 — e, )I 
= liminf||(a — ae,) +(y — ye, )|| 
= liminf||a — ae,]|| 


since ||y — ye,|| — 0. Thus ||a + y|| > inf,||a — ae,|| > inf{|la — ax||: x © 
(ball 7), }. Taking the infimum over all y in J gives the desired remaining 
inequality. = 


PROOF OF THEOREM 4.6. The only difficult part of this proof is to show that 
ja + I||? = |Ja*a + I|| for every a in &. If x © J, then 
la + xl|? = (a + x) "(a + x)I 
= |la*a + a*x + x*a + x*x|| > inf{|ja*a + yl]: y ET} 
= |la*a + I|| 


since a*x + x*a + x*x © I whenever x € J (4.3). On the other hand, the 
preceding lemma gives that 


la + 1||* = inf{||a — ax||*: x € (ball 7), } 

inf{||a(1 — x)||?: x € (ball 7), } 

inf{\|(1 — x)a*a(1 — x)|: x © (ball 7), } 
inf{||a*a(1 — x)|]: x © (ball 7), } 

inf {||a*a — a*ax||: x © (ball J), } 

lla*a + I||. rt 


IA 


If ., @ are C*-algebras with ideals J, J, respectively, and 9: W> # 1s 
a *-homomorphism such that p(/) CJ, then p induces a *-homomor- 
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phism p: W/I ~ B/J defined by p(a + J) = p(a) +J. In particular, if 
I =kerp, then p: &/kerp ~ # is a *-homomorphism and p°7 = p, 
where 7: &%-—> &/kerp is the natural map. Keep these facts in mind when 
reading the proof of the next result. 


4.8. Theorem. Jf <,& are C*-algebras and p: Y—> B is a *-homomor- 
phism, then |\p(a)|| < |lal| for all a and ranp is closed in @. If p is a 
*-monomorphism, then p is an isometry. 


PROOF. The fact that ||p(a)|| < ||a|| is a restatement of (1.11d). Now assume 
that p is a *-monomorphism. As in the proof of (1.11d), it suffices to 
assume that »w and & have identities and p(1) = 1. (Why?) 

If a& VM and a=a%, then it is easy to see that p(a) = p(a)* and 
o(p(a)) C o(a). If o(p(a)) # o(a), there is a continuous function f on 
o(a) such that f(t) = 0 for all ¢ in o( p(a)) but f is not identically zero on 
o(a). Thus f(p(a)) = 0, but f(a) # 0. Let { p,} be polynomials such that 
P,(t) > f(t) uniformly on o(a). Thus p,(a)— f(a) and p,(p(a)) > 
f(p(a)) = 0. But p,(p(a)) = p( p,(a)) > p(f(a)). Thus p( f(a)) = f(p(a)) 
= 0. Since p was assumed injective, f(a) = 0, a contradiction. Hence 
o(a) = o(p(a)) if a = a*. Thus by (1.11e), |lal| = r(a) = r(e(a)) = ||p(a)]| 
if a= a*. But then for arbitrary a, |jajj* = |ja*a\j = ||jp(a*a)|| = 
1p(a)*p(a)|| = ||o(a)||? and p is an isometry. 

To complete the proof let p: s— @ be a *-homomorphism and let p: 
W/kerp — & be the induced *-monomorphism. So p is an isometry and 
hence ran p 1s closed. But ran 6 = ran op. a 


We turn now to some specific examples of C*-algebras and their ideals. 


4.9. Proposition. [f X is compact and I is a closed ideal of C( X), then there 
is a closed subset F of X such that I = { f © C(X): f(x) = 0 for all x in F }. 
Moreover, C(X)/I is isometrically isomorphic to C(F ). 


PRooFr. Let F = {x © X: f(x) = 0 for all f in J}, so F is a closed subset 
of x. If we M(X) and p J, then /{|f|?du = 0 for every f in J since 
\f|? = ff © I whenever f © J. Thus each f must vanish on the support of p:; 
hence |u|(X \ F) = 0. Conversely, if p € M(X) and the support of p is 
contained in F, {fdu=0 for every f in J. Thus 1*= {hp © M(X): 
lu|(X \ F) = 0}. Since J is closed, J =*(1~) = {f€ CCX): f(x) = 0 for 
all x in F'}. The remainder of the proof is left to the reader. a 


4.10. Proposition. Jf J is a closed ideal of B(#), then ID B(#) or 
I = (0). 


PROOF. Suppose J # (QO) and let J be a nonzero operator in J. Thus there 
are vectors fp, f, in # such that Tf) =f, #0. Let go, 2, be arbitrary 
nonzero vectors in #. Define A: #—> # by letting Ah = ||gol| 7h, 25) fo. 
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Then Ag, =f, and Ah=0 if h 1g. Define B: #->H# by letting 
Bh = |\f, ll 7<4, f.) 8). So BA, = g, and Bh = 0 if h 1 f,;. Thus BTAh = 0 
if h 1 g, and BTAg, = g,. Hence for any pair of nonzero vectors gy, g, in 
H# the rank-one operator that takes g, to g, and is zero on [g,]~ belongs 
to J. From here it easily follows that J contains all finite-rank operators. 
Since J is closed, J D B)(#). a 


It will be shown in (IX.4.2), after we have the spectral theorem, that if J is 
a closed ideal in @(#) and # is separable, then J = (0), @,(#), or 
BH). 


EXERCISES 
1. Complete the proof of Proposition 4.9. 
2. Show that M,(C) has no nontrivial ideals. Find all of the left ideals. 


3. If @ is an infinite cardinal number, let J = {A © &@(#): dimcl(ran A) < a}. 
Show that J, is a closed ideal in #(%). 


4. Let S be the unilateral shift on /*. Show that C*(S)D &,(/*) and 
C*(S)/@,(I’) is abelian. Show that the maximal ideal space of C*(S)/&,(/) 
is homeomorphic to dD. 


5. If V is the Volterra operator on L*(0,1), show that C*(V) = C + &@,(L7(0,1)). 


6. If WM is a C*-algebra, J is a closed ideal of .&%, and @ is a C*-subalgebra of #, 
show that the C*-algebra generated by ]U # is 1+ &. 


7. If M is a C*-algebra and J and J are closed ideals in &, show that /+ J isa 
closed ideal of 2. 


§5*. Representations of C*-Algebras and the 
Gelfand—Naimark~—Segal Construction 


5.1. Definition. A representation of a C*-algebra is a pair (7, # ), where 
is a Hilbert space and 7: ~—> B(#) is a *-homomorphism. If © has an 

identity, it is assumed that (1) = 1. (The algebras considered in this book 

are assumed to have an identity. This proviso is given for the reader’s 

convenience when consulting the literature.) Often #7 is deleted and we say 

that 7 is a representation. 


5.2. Example. If # is a Hilbert space and ww is a C*-subalgebra of 
B(#), then the inclusion map 7 > &(#) is a representation. 


5.3. Example. If 1 is any cardinal number and # is a Hilbert space, let 
HH") denote the direct sum of #% with itself n times. If A € #(#), then 
A is the direct sum of A with itself n times; so A € B(.%'"’) and 
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\|A(|| = ||A||. The operator A‘ is called the inflation of A. If 7: ’%-> 
B(#H) is a representation, the inflation of 7 is the map 1: SP B( KH”) 
defined by 7°"(a) = 2(a)™ for all a in &. 


5.4. Example. If (X, 2, 1) is a o-finite measure space and #= L*(p), then 
am: L™() > &B( #) defined by 7($) = M, is a representation. 


5.5. Example. If X 1s a compact space and p is a positive Borel measure 
on X, then 7: C(X) > &(L*(p)) defined by 7(f) = M, is a representa- 
tion. 


5.6. Definition. A representation 7 of a C*-algebra © is cyclic if there is a 
vector e in # such that cl[7(.<)e] = #; e is said to be a cyclic vector for 
the representation 7. 


Note that the representations in Examples 5.4 and 5.5 are cyclic (Ex- 
ercises 2 and 3). Also, the identity representation i: @(#) - B(#) 1s 
cyclic and every nonzero vector is a cyclic vector for this representation. If 
L= C+ B(#), then the identity representation is cyclic. On the other 
hand, if n > 2, then the inflation 7°”) of a representation of C(X) is never 
cyclic (Exercise 4). 

There is another way to obtain representations. 


5.7. Definition. If {(7,, 7%): i © I} is a family of representations of , 
then the direct sum of this family is the representation (7, #), where 
H= 0,4, and m(a) = {7,(a)} for every a in &. 


Note that since ||7,(a)|| < ||a|| for every i (4.8), z(a) is a bounded 
operator on #%. It is easy to check that 7 is a representation. 


5.8. Example. Let X be a compact space and let {u,} be a sequence of 
measures on X. For each n let 7: C(X)— &(L*(p,)) be defined by 
7,(f) = M, on L*(u,,). Then 7 = © 7, is a representation. If the measures 
{p, } are pairwise mutually singular, then 7 is equivalent (below) to the 
representation f > M, of C(X) > B(L*(p)), where wp = Lp, /2” ||, 
(Exercise 5). 


The concept of equivalence for representations is that of unitary equiv- 
alence. That is, two representations of a C*-algebra ©, (7,,%,) and 
(7, #,), are equivalent if there is an isomorphism U: #, > #, such that 
Un,(a)U~* = 7,(a) for every a in x. The importance of cyclic representa- 
tions arises from the fact, given in the next result, that every representation 
is equivalent to the direct sum of cyclic representations. 


5.9. Theorem. Jf 7 is a representation of the C*-algebra &, then there is a 
family of cyclic representations {1,} of & such that m and ©,7, are 
equivalent. 


256 VII. C*-Algebras 


Proor. Let &= the collection of all subsets E of nonzero vectors in # 
such that 7(.W%)e 1 7(@)f for e, f in E withe # f. Order & by inclusion. 

An application of Zorn’s Lemma implies that & has a maximal element £5. 

Let 4 = @({cll7(H)e]: e& Ey}. If he HO H, then 0 = (r(a)e,h) 

for every a in #@ and e in E,. So if a,bE& XM and ec Ey, 0= 

(a(b*a)e,h) = (m(b)*r(a)e,h) = (r(aje,a(b)h). That is, m(%)et 

a(&)h for all e in E,. Hence E, U {h} € @, contradicting the maximality 
of E,. Therefore #”7= %. 

For e in E, let # = cl[7(#)e]. If a € &, clearly (a) H, C H,. Since 
a* € WM and m(a)* = 7(a*), H reduces x(a). So if 7: PJ BH) is 
defined by 7,(a) = 7(a)|#, 7, is a representation of a. Clearly 7 = © {7,: 
Ce Ee). B 


In light of the preceding theorem, it becomes important to understand 
cyclic representations. To do this, let 7: ~%— B(#) be a cyclic representa- 
tion with cyclic vector e. Define f: “%— C by f(a) = (m(a)e,e). Note that 
f is a bounded linear functional on # with ||f|| < |lel||?. Since f(1) = |lell’, 
[fl] = lel’. Moreover, f(a*a) = (a(a*a)e,e) = (m(a)*r(a)e,e) = 
I7(a)el|? > 0. 


5.10. Definition. If ./ is a C*-algebra, a linear functional f: “~%— C is 
positive if f(a) = 0 whenever a € Y,. A state on & 1s a positive linear 
functional on of norm 1. 


5.11. Proposition. If f is a positive linear functional on a C*-algebra &, 
then 


f(y *x)? < f(y *y) f(x*x) 
for every x, y in &. 


ProoF. If [x, y] = f(y*x) for x, y in &, then [-, -] is a semi-inner product 
on ». The proposition now follows by the CBS inequality (1.1.4). ie 


5.12. Corollary. If f is a positive linear functional on the C *-algebra , then 
f is bounded and ||f|| = f(). 


5.13. Example. If X is a compact space, then the positive linear functionals 
on C(X) correspond to the positive measures on X. The states correspond 
to the probability measures on X. 


As was shown above, each cyclic representation gives rise to a positive 
linear functional. It turns out that each positive linear functional gives rise 
to a cyclic representation. 
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5.14. Gelfand—Naimark—Segal Construction. Let < be a C*-algebra with 
identity. 


(a) If f is a positive linear functional on x, then there is a cyclic representa- 
tion (1, #,) of L with cyclic vector e such that f(a) = (a,(a)e,e) for 
alla in &. 

(b) If (7, #) is a cyclic representation of s with cyclic vector e and 
f(a) = (a(a)je,e) and if (1,, #;) is constructed as in (a), then a and Ty 
are equivalent. 


Before beginning the proof, it will be helpful if the theorem is examined 
when & is abelian. So let x= C(X) where X is compact. If f is a positive 
linear functional on .o%, then there is a positive measure p on X such that 
f(¢) = fodp for all @ in &. The representation (7,, #,) is the one 
obtained by letting #, = L*(u) and 7,(¢) = M,, but let us look a little 
closer. One way to obtain L*(p) from C(X) and p is to let Y= {6 € C(X): 
{\o|* du = 0}. Note that # is an ideal in C(X). Define an inner product on 
C(X)//L by (6+ LW +L) = {ow du. The completion of C(X)/¥ with 
respect to this inner product is L7(,1). 

To see part (b) in the abelian case, let 7: C(X) ~ @(#) be a cyclic 
representation with cyclic vector e. Let p be the positive measure on X such 
that /odu = (2(o)e,e) = f(¢). Now define U,: C(X) ~ # by U,(¢) = 
m()e. Note that U, is linear and has dense range. If & is as in the 
preceding paragraph and ¢¢€¥Y, then ||U,(¢)||* = (7(o)e, 7(d)e) = 
(1(o*p)e,e) = {\o|? du = 0. So U,.L= 0. Thus U, induces a linear map 
U: C(X)/L-> # where U(o+H)=ar(o)e. If (6+ L%,~t+H) = 
fob du, then (U(p + LZ), UY + L)) = (a(g)e, m(W)e) = (1(OY*)e, e) 
= fovdp = (¢+ #,y + &). Thus U extends to an isomorphism U from 
the completion of &/L= L*(h) onto #. So U: L*(p) > # and if 
@ © C(X) and we think of C(X) as a (dense) subset of L?(.), Ud = 7(o)e. 
If ¢,y Ee C(X), then UM,y = U(oy) = a(ovje = m(o)m(y)e = 
m(p)U(y); that is, UM, = 7($)U on a dense subset of L*(y) and, hence, 
UM, = 7(¢)U for every ¢ in C(X). In other words, 7 is equivalent to the 
representation @ > Mg. 


PROOF OF THEOREM 5.14. Let f be a positive linear functional on » and 
put Y= {x © A: f(x*x) = 0}. It is easy to see that Y is closed in &. 
Also if a &© & and x € ¥, then (5.11) implies that 


f((ax)*(ax))" = f(x*(a%ax))’ 
< f(x*x) f(x*a*aa*ax ) 
= (). 


That is, # is a closed left ideal in ». Now consider #~/Y as a vector 
space. (Since ¥ is only a left ideal, /¥ is not an algebra.) For x, y in &%, 
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define 
(x+L,y+L) =f(y*x). 


It is left as an exercise for the reader to show that ¢ - ,-) is a well-defined 
inner product on ~/%. Let #; be the completion of ~/Y% with respect to 
the norm defined on &#/¥ by this inner product. 

Because ¥ is a left ideal of 7, x + LYrax + £ 1s a well-defined linear 
transformation on &Y. Also, |lax + ||? = (ax + Y,axt+ LY) = 
f(x*a*ax). Now if ||a*a|| is considered as an element of 7 (it is a multiple 
of the identity), then an appeal to the functional calculus for a*a shows that 
\|a*a|| — a*a > 0. Hence (Exercise 3.7) 0 < x*(||a*a|| — a*a)x = |la||?x*x 
— x*a*ax; that is, x*a*ax < |la\|?x*x. Therefore |jax + L||* < |lal|?f(x*x) 
= |lal|7\|x + L||?. Thus if 7,(a): &//L— A/L is defined by 7,(a)(x + #) 
= ax + £, m,(a) is a bounded linear operator with ||7,(a)|| < ||a||. Hence 
m,(a) extends to an element of #(.#;). It is left to the reader to verify that 
m,. SZ —> B(HA,) is a representation. 

Pute=1+ # in #. Then a,(#)e = (a+ 2: ae WL} = A/L which, 
by definition, is dense in #;. Thus e is a cyclic vector for 7,. [Also note that 
(m,(a)e,e) = f(a).] This proves (a). 

Now let (7, #), e, and f be as in (b) and let (7,, #,) be the representa- 
tion constructed. Let e, be the cyclic vector for 7, so that f(a) = 
(m,(a)e,,e,) for all a in &. Hence (7;(a)e;,e,) = (m(a)e,e) for all a in 
x. Define U on the dense manifold 7,(.7 )e, in #, by Un,(a)e,= m(a)e. 
Note that ||a(a)el|? = (a(a)e, t(a)e) = (a(a*a)e,e) = (,(a*a)e;, e;) 
= ||7,( a)e,||*. This implies that U is well defined and an isometry. Thus U 
extends to an isomorphism of # onto #. If x,a © W, then Un,(a)7,/(x Je, 
= Un,(ax )e,= m(a)a(x)e = 1(a)Um,(x Je. Thus a(a)U = Un;(a) so 
that 7 and 7, are equivalent. i 


The Gelfand—Naimark—Segal construction is often called the GNS con- 
struction. 

It is not difficult to show that if f is a positive linear functional on . and 
a > Q, then the representations 7, and 7,, are equivalent (Exercise 8). So it 
is appropriate to only consider the cyclic representations corresponding to 
states. If <M is a C *-algebra, let S,,= the collection of all states on /. Note 
that SC ball w#*. S,, is called the state space of %. 


5.15. Proposition. [f 2 is a C*-algebra with identity, then S,, is a weak* 
compact convex subset of ¥* and ifa = W,, then |\|a|| = sup{ f(a): fe Sy} 
and this supremum is attained. 


PROOF. Since S,,C ball .#*, to show that S,, is weak* compact, it suffices 
to show that S,, is weak* closed. The reader can supply this proof using 
nets. Clearly S,, is convex. 
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If = C(X) with X compact and f © C(X),, then there is a point x in 
X such that f(x) = [fll Thus |/fl| = ff dd, = sup{ [fdu: p< 
(ball M(X)),}. If # is arbitrary and a &w#,, then |la|| > sup{ f(a): 
f = S,}. Also, from the argument in the abelian case, there is a state f; on 
C*(a) such that f,(a@) = |la||. If we can show that f, extends to a state f on 
&, the proof is complete. That this can be done is a consequence of the next 
result. a 


5.16. Proposition. Let 2%, B be C*-algebras with BC &. If f, is a state on 
B, then there is a state f on & such that f|B = fy. 


Proor. Consider the real linear spaces Re and Re@. If a & #,, then 
a < |ja|| in 7. Since 1 © Re#W, Re# has an order unit. By Corollary 
III.9.12, if f, © Sg there is a positive linear functional f on Rew such that 
f|ReB = f,. Since e € &, f(1) = f,(1) = 1. Now let f(a) = f((a + a*)/2) 
+ if((a — a*)/2i) for an arbitrary a in &. It follows that fe S, and 
{=f i 


The next result says that every C *-algebra is isomorphic to a C *-algebra 
contained in #(#) for some #. Thus each C *-algebra “is” an algebra of 
operators. 


5.17. Theorem. [f 2 is a C*-algebra, then there is a representation (1, #) 
of L such that w is an isometry. If ¥ is separable, then # can be chosen 
separable. 


PROOF. Let F be a weak* dense subset of S,, and let 7 = O{a,: fe F}, 
H= O{H,. f © F}. Thus |la||? = ||7(a)||? = sup;|iz,-(a)||*. If e, is the 
cyclic vector for 7, then lle fll? = (e,,€¢) = (m7 ()e,,e-) = f(1) = 1. Hence 
lim-(a)||? > IIn(a)erl|? = (m/(a*a)e,,e,) = f(a*a), and |lall? > |Ir(a)|l? 
> sup{ f(a*a): fe F}. Since F is weak* dense in S,,, Proposition 5.15 
implies sup{ f(a*a): f © F} = |la*a|| = |\a||?. Hence 7 is an isometry. 

If .% is separable, (ball .#*,wk*) is a compact metric space (V.5.1). 
Hence S,, is weak* separable so that the set F of the preceding paragraph 
can be chosen to be countable. Now if f © F, 7(.#)f is a separable dense 
submanifold in #%, since © is separable. Thus #% is separable. It follows 
that # is separable. | 


Actually, more can be said if is separable. In fact, every separable 
C *-algebra has a cyclic representation that is isometric (Exercise 12). 


EXERCISES 


1. Let Ww be a C*-algebra with identity and let 7: W/—> @(H) be a *-homomor- 
phism [but don’t assume that 7(1) = 1]. Let P, = 7(1). Show that P, is a 
projection and #, = P,# reduces 7(.). If 7,(a) = 7(a)|%, show that 7,: 
WS B(H,) is a representation. 
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2. Show that the representation in Example 5.4 is a cyclic representation and find 
all.of the cyclic vectors. 


3. Show that the representation in Example 5.5 is a cyclic representation and find 
all the cyclic vectors. 


4. If X is compact and yp is a positive measure on X, let 7,: C(X) — & L?()) 
be the representation defined in Example 5.5. If u,v are positive measures on X 
show that 7, ® 7, is cyclic if and only if piv. If piv, then 7, @7, is 
equivalent to 7, ,,. Also, 7{”) is not cyclic if n > 2. 


5. Verify the statements in Example 5.8. 


6. If @=C + B(H#) and 7: PM B(H) is the identity representation, show 
that 7‘~? is a cyclic representation. 


7. Fix a Banach limit LIM on /”(N) and let # be a separable Hilbert space with 
an orthonormal basis {e,,}. Define f: @(3/) > C by f(T) = LIM{(Te,, e,,)}. 
Show that f is a state on @(#). If 7, is the corresponding cyclic representa- 
tion, show that ker, = @)(#). Hence a, induces a cyclic representation of 
B(H#)/By(#) that is isometric. Is #; separable? 


8. If f is a positive linear functional on & and a € (0, 00), show that 7, and 7,, 
are equivalent representations. 


9. If a € MW, then a > 0 if and only if f(a) = 0 for every state f. 
10. If a € M and a # 0, then there is a state f on & such that f(a) # 0. 


11. If f is a state on & and 7, is the corresponding representation, then 7, is 
injective if and only if {x ©. W: f(x*x) = 0} = (0). 


12. If x is a separable C*-algebra and { f, } is a countable weak* dense subset of 
Sv, let f=2,2 "f,. Show that 7, is an isometry. 


CHAPTER IX 


Normal Operators on Hilbert Space 


In this chapter the Spectral Theorem for normal operators on a Hilbert 
space is proved. This theorem is then used to answer a number of questions 
concerning normal operators. In fact, the Spectral Theorem can be used to 
answer essentially every question about normal operators. 


§1. Spectral Measures and Representations 
of Abelian C*-Algebras 


Before beginning this section the reader should familiarize himself with the 
definitions and examples in (VIII.5.1) through (VIII.5.8). 

In this section we want to focus our attention on representations of 
abelian C*-algebras. The reason for this is that the Spectral Theorem and its 
generalizations can be obtained as a special case of such a theory. The idea 
is the following. Let N be a normal operator on #. Then C*(N) is an 
abelian C*-algebra and the functional calculus f>f(N) is a *-iso- 
morphism of C(o(N)) onto C*(N) (VIII.2.6). Thus f+ f(N) is a repre- 
sentation C(o(N)) ~ &(#) of the abelian C*-algebra C(o(N)). A diag- 
nosis of such representations yields the Spectral Theorem. 

A representation p: C(X) — @(4#) is a *-homomorphism with p(1) = 1. 
Also, ||o|| = 1 (VIII.1.11d). If f © C(X),, then f = g* where g € C(X),; 
hence p(f) = p(g)* = p(g)*(g) = 0. So p is a positive map. One might 
expect, by analogy with the Riesz Representation Theorem, that p(f) = 
[fdE for some type of measure E whose values are operators rather than 
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scalars. This is indeed the case. We begin by introducing these measures and 
defining the integral of a scalar-valued function with respect to one of them. 


1.1. Definition. If X is a set, (2 is a o-algebra of subsets of X, and #7 isa 
Hilbert space, a spectral measure for (X, 2, #@) isa function E: Q ~ B(#) 
such that: 


(a) for each A in (2, E(A) is a projection; 

(b) E(Q) = 0 and ECX) = 1; 

(c) E(A, 1 A,) = E(A,)E(A,) for 4, and A, in Q; 
(d) if {A,, }°°_, are pairwise disjoint sets from 12, then 


oO 


e{ U4] =D EAL). 


n=l] 


A word or two concerning condition (d) in the preceding definition. If 
{ E,,} is a sequence of pairwise orthogonal projections on #, then it was 
shown in Exercise II.3.5 that for each h in #, °°, FE (A) converges in # 
to E(h), where E is the orthogonal projection of # onto V{E,(#): 
n> 1}. Thus it is legitimate to write E = L°°_,E,. Now if 4, 1 4, =O, 
then (b) and (c) above imply that 0 = E(A,)FE(4,) = E(4,)£(4,); that is, 
E(A,) and E(A,) have orthogonal ranges. So if {4,}{° is a sequence of 
pairwise disjoint sets in (2, the ranges of { F(A,,)} are pairwise orthogonal. 
Thus the equation E(UA,) = LE(A,) in (d) has the precise meaning just 
discussed. 

Another way to discuss this is by the introduction of two topologies that 
will also be of value later. 


1.2. Definition. If 7 is a Hilbert space, the weak operator topology (WOT) 
on &(#) is the locally convex topology defined by the seminorms { p,, ,: 
h,k © #} where p, ,(A) = |( AA, k)|. The strong operator topology (SOT) 
is the topology defined on 4(#) by the family of seminorms { p;: 
h © # }, where p,(A) = ||AAll. 


1.3. Proposition. Let # be a Hilbert space and let { A,} be a net in B(H). 


(a) A, ~ A (WOT) if and only if (A,;h,k) > (Ah,k) for allh,k in #. 

(b) If sup,||A,|| < 0c and 7 is a total subset of #, then A; > A (WOT) if 
and only if (A;h,k) > (Ah,k) for allh,k in J. 

(c) A, ~ A (SOT) if and only if \|A;h — Ah|| > 0 for allh in #. 

(d) If sup,||A,|| < 00 and 7 is a total subset of #, then A, > A (SOT) if 
and only if ||A,;h — Ah|| ~ 0 for allh in J. 

(e) If # is separable, then the WOT and SOT are metrizable on bounded 
subsets of B(#). 
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PRooF. The proofs of (a) through (d) are left as exercises. For (e), let {h,,} 
be any countable total subset of ball 7. If A, B © @(#), let 


d,(A,B)= )/ 2°"|\(A — B)A,|, 
n=1 


d,,(A, B) = 2 2 NCA = B)h,, Le) B 


m,n=1 


Then d, and d,, are metrics on #(%). It is left as an exercise to show that 
d, and d,, define the SOT and WOT on bounded subsets of #(#). a 


1.4. Example. Let (X, 2, 1) be a o-finite measure space. If ¢ € L*(p), let 
M, be the multiplication operator on L*(p). Then a net {¢,} in L°(p) 
converges weak* to ¢ if and only if M, > M, (WOT). In fact, if f, g € 
L?(u) and $, >  weak* in L*(u), then (M, f,g) = [6,fEdu > Jofzdu 

= (M,f,g) since fg € L'(u). Conversely, if M,, > M, (WOT) and fe 
Lu), then f= g,Z), where g,,g, © L%(u). (Why?) So J¢,fdy = 
(Mg 81 82) > (M81, 82) = fofdp. 


1.5. Example. If { £, } is a sequence of pairwise orthogonal projections on 
H, then LYE, converges (SOT) to the projection of # onto V{ E,(#): 
we 1}. 

In light of (1.5), a spectral measure for (X, 2, 4) could be defined as a 
SOT-countably additive projection-valued measure. 


1.6. Example. Let X be a compact set. (2 = the Borel subsets of X, uw = a 
measure on 2, and #= L*(n). For A in Q, let E(A) = multiplication by 
x4, the characteristic function of A. E is a spectral measure for (X, 2, #). 


1.7. Example. If E is a spectral measure for (X, 2, 7), the inflation, E"”, 
of E, defined by E‘”(A) = E(A)™, is a spectral measure for (X, 2, 7°”). 


1.8. Example. Let X be any set, (2 = all the subsets of X, #= any sep- 
arable Hilbert space, and fix a sequence {x,,} in X. If {e,,e5, ...} is some 
orthonormal basis for #, define E(A) = the projection onto V{e,: x, € A}. 
E is a spectral measure for (X, 2, #). 


The next lemma is useful in studying spectral measures as it allows us to 
prove things about spectral measures from known facts about complex- 
valued measures. 


1.9. Lemma. /f E is a spectral measure for (X,Q, 3) and g,h © #, then 
E,.,(A) = (E(A)g,h) 


defines a countably additive measure on {2 with total variation < ||g\|\||h\|. 
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ProorF. That » = E, , as defined above, is a countably additive measure is 
left for the reader to verify. If 4,,..., 4, are pairwise disjoint sets in &2, let 
a, € C such that ja) = 1 ands KE(A, \g,h)| =a, jE(A, )g,h). So 
5 jlu(4;)| = Lja,(E(A,)g, A) = (L;E(A, Jar, gen) = I, E(A, Jo giliAll- 
Now CE (A, Jo, g: l<j<njisa Anite sequence of pairwise orthogonal 
vectors so that ID, E(A a,gll* = LE(A,)gll* = ||E(U"_jA,)gll? < Ilgil?s 
hence 1 |u(4,)| < IllllAll. Thus lull < sIlIlAl. a 


It is possible to use spectral measures to define representations. The next 
result is crucial for this purpose. It tells us how to integrate with respect to a 
spectral measure. 


1.10. Proposition. /f E is a spectral measure for (X,0, #) and ¢: X > C 
is a bounded St-measurable function, then there is a unique operator A in 
B(H#) such that if ¢>0 and {A,,...,4,} is an Q-partition of X with 
sup{|o(x) — $(x’)|: x, x’ € A, } < e for1 < k <n, then for any x, in A,, 


[4 E olsye(4) 


k=1 


Proor. Define B(g,h) = {odE, ,, for g,h in #. By the preceding lemma it 
is easy to see that B is a a squihneai form with |B(g,)| < |||, |g |All. 
Hence there is a unique operator A such that B(g,h) = (Ag,h) for all g 
and hin #. 

Let {A,,...,4,} be an Q2-partition satisfying the condition in the state- 
ment of the proposition. If g and A are arbitrary vectors in # and x, € A, 
for 1 < k <n, then 


(Ag, hy — ), (x, ){E(A,)8, 4) 


k=1 


| 


o(x) — o(x,)|d(E(x)g,h) 


lA 


Ef ols) — 66a dEO)8. 1) 


<efd\(E(x)g,h)| <ellgillial. 


The operator A obtained in the preceding proposition is the integral of > 
with respect to E and is denoted by 


ode. 


Therefore if g,h € # and 9 is a bounded §2-measurable function on _X, the 
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preceding proof implies that 


1.11 (| foae) s,m) = [$dE, ,. 


Let B(X,92) denote the set of bounded {2-measurable functions ¢: 
X — € and let ||@|| = sup{|o(x)|: x © X}. It is easy to see that BCX, (2) is 
a Banach algebra with identity. In fact, if $*(x) = $(x), then BCX, Q2) is 
an abelian C *-algebra. The properties of the integral {¢ dE are summarized 
by the following result. 


1.12. Proposition. Jf E is a spectral measure for (X,Q, 37) and p: B( X,Y) 
— B(H) is defined by p(¢) = {odE, then p is a representation of BCX, (2). 


PROOF. It will only be shown that p is multiplicative; the remainder is an 
exercise. Let @ and yy € C(X). Let e > 0 and choose a Borel partition 
{4,,...,4,} of X such that sup{|w(x) — w(x’)|: x,x’ Ee A,}<e for 
w =, Y or oy and forl1 <k <n. Hence, if x,<eA, (1 <k <n), 


|fooat ~ ¥ w(x) £(4,) 


for w = ¢, y, or dy. Thus, using the triangle inequality, 


fovae -( foae)| fo ae 


c+] ¥ oleoeta)-| E olspe(a)]/ E4624) 
+ [Eee PE vexpe 4,)|-[foae)( fvae)), 


But E(A,)E(A4,) = E(4,;14,) and {A,,...,4,} is a partition. So the 
middle term in this sum is zero. Hence 


\Jovae-( fo] foae]| 


a | i a(s)e(a S v(x,)E(A,) - fv 


f ¥4E |} s elt + loll + toll. 


+ s o(x,)E(A,) - fat] 


Since € was arbitrary, {py dE = ( {oy dE) fy dE). a 


1.13. Corollary. Jf X is a compact Hausdorff space and E is a spectral 
measure defined on the Borel subsets of X, then p: C(X) > B(H#) defined by 
p(u) = fudE is a representation of C(X). 
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The next result is the main result of this section and it states that the 
converse to the preceding corollary holds. 


1.14. Theorem. Jf p: C(X) > &4(#) is a representation, there is a unique 
spectral measure E defined on the Borel subsets of X such that 


p(u) = fude 


for every uin C(X). 


Proor. The idea of the proof is similar to the idea of the proof of the Riesz 
Representation Theorem for linear functionals on C( X). We wish to extend 
p to a representation p: B(X) — @(#), where B(X) is the C*-algebra of 
bounded Borel functions. The measure FE of a Borel set A is then defined by 
letting E(A) = 6(x,). In fact, it is possible to give a proof of the theorem 
patterned on the proof of the Riesz Representation Theorem. Here, how- 
ever, the proof will use the Riesz Representation Theorem to simplify the 
technical details. 

If gh & #, then u (p(u)g,h) is a linear functional on C(X) with 
norm < ||g||||/||. Hence there is a unique measure, p, ,, in M(X) such that 


1.15 (p(u)g,h) = fudu,., 


for all uw in C(X). It is easy to verify that the map (g,h)'> py, , 1s 
sesquilinear (use uniqueness) and |p, || < |lgl|||l|. Now fix @ in BCX) and 
define [g,h] = ody, ,. Then [-,-] is sesquilinear form and |[g,/]| < 
li>|I || g||||2||. Hence there is a unique bounded operator A such that [g, h] = 
(Ag,h) and ||A|| < |||] (II-2.2). Denote the operator A by p(¢). So 9: 
B(X) > &(#) is a well-defined function, ||6(¢)|| < ||@||, and for all g,h 
in #, 


1.16 (P($)8,h) = fodu,». 


1.17. Claim. 6p: B(X) — 4(#) is a representation and p|C(X) = p. 


The fact that 6(u) = p(u) whenever u € C(X) follows immediately from 
(1.15) and (1.16). If ¢ © B(X), consider @ as an element of M(X)* 
(= C(X)**); that is, @ corresponds to the linear functional p> /o@dp. By 
Proposition V.4.1, {u € C(X): ||ul| < |||} is oc MCX)*, MCX)) dense in 
{L © M(X)*: ||L|| < |||}. Thus there is a net {u,;} in C(X) such that 
\lz,|| < |||] for all wu; and fu;dp > {dp for every win M(X). If py © BCX), 
then yp © M(X) whenever p € M(X). Hence fu;p du — {oy du for every 
y in B(X) and p in M(X). By (1.16), 6(u;y) > p( oY) (WOT) for all y in 
B(X). In particular, if ~ © CCX), then p(¢y) = WOT — lim p(u,V) = 
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WOT — lim p(u;)p(Y) = 6() p(y). That is, 


P( oy) = 6(>)0(y) 
whenever ¢ € B(X) and yy € C(X). Hence 6(u;)) = p(u;)p() for any y 
in B(X) and for all u;. Since p(u;) ~ 6p(¢) (WOT) and p(u,y) > p( oy) 
(WOT), this implies that 

B(oy) = 6(o) p(y) 
whenever ¢, ~ € B(X). 

The proof that p is linear is immediate by (1.16). To see that 6(¢)* = 
p(>), let {u,;} be the net obtained in the preceding paragraph. 
If u © M(X), let fi be the measure defined by (4) = (A). Then p(u,) > 
p(¢) (WOT) and so p(u,;)* > p(¢)* (WOT). But fu,du = fu;du > fodp 
= [odp for every measure p. Hence p(u;) > 6(¢). But p(u,;)* = p(u;) 
since p is a *-homomorphism. Thus 6(¢)* = p(#) and is a representa- 
tion. 

For any Borel subset A of X let E(A) = p(x,). We want to show that E 
is a spectral measure. Since x, is a hermitian idempotent in B(X), E(A) is 
a projection by (1.17). Since x, = 0 and x, = 1, E(Q) = Oand E(X) = 1. 
Also, E(4, 1 4,) = P(Xa,na,) = P(Xa Xa.) > E(4,)E(4,). Now let 
{4,,} be a pairwise disjoint sequence of Borel sets and put A, = U?_,,,4,. 
It is easy to see that E is finitely additive so if h € #, then 


= (E(A, )h, E(A,)h) 


n 


| Ua,)s — )} E(A,)h 


k=1 


= (E(A,)h,h) 
= (O(xa, Ash) 


[xs, 4a 


» Hn n(A,) > 0 


k=n+1 


as n — oo. Therefore E 1s a spectral measure. 

It remains to show that p(u) = fudE. It will be shown that 6(¢) = {¢dE 
for every ¢ in B(X). Fix » in B(X) and e > 0. If {A,,...,A,} is any 
Borel partition of X such that sup{|¢(x) — $(x’)|: x, x’ € A,} <e for 
1<k <n, then || — Lf-16(%,)Xa,lleo < € for any choice of x, in Ay. 
Since ||A|| = 1, e > {]6(¢) — L%_19(x,)E(A,)I|. This implies that p() = 
/odE for any ¢ in B(X). 

The proof of the uniqueness of E is left to the reader. | 


EXERCISES 
1. Prove Proposition 1.3. 


2. Show that ball @(.#) is WOT compact. 


268 IX. Normal Operators on Hilbert Space 


3. Show that Re 4@(#) and @(#), are WOT and SOT closed. 


4. If L: @(#) > Cis a linear functional, show that the following statements are 
equivalent: (a) L is SOT-continuous; (b) L is WOT-continuous; (c) there are 
vectors hy,...,4,,81,---58, i % such that L(A) = L?_|(Ah,, g;). 


5. Show that a convex subset of @(#) is WOT closed if and only if it is SOT 
closed. 


6. Verify the statement in Example 1.5. 
7. Verify the statements made in Examples 1.6, 1.7, and 1.8. 


8. For the spectral measures in (1.6), (1.7), and (1.8), give the corresponding 
representations. 


9. If { £,} is a net of projections and E is a projection, show that E,; ~ E (WOT) 
if and only if FE; — E (SOT). 


10. For the representation in (VIII.5.5), find the corresponding spectral measure. 


11. In Example VIII.5.4, the representation is not quite covered by Theorem 1.14 
since it is a representation of L*(p) and not C( X). Nevertheless, this represen- 
tation is given by a spectral measure defined on 2. Find it. 


12. Let X be a compact Hausdorff space and let {x,,} be a sequence in X. Let {e,,} 
be an orthonormal basis for # and for each u in C(X) define p(u) in 4(#) 
by p(u)e, = u(x,,)e,. Show that p is a representation and find the correspond- 
ing spectral measure. 


13. A representation p: > B(#) is irreducible if the only projections in B(#) 
that commute with every p(a), a in &, are 0 and 1. Prove that if is abelian 
and p is an irreducible representation of #%, then dim #= 1. Find the corre- 
sponding spectral measure. 


14. Show that a representation p: C(X) —- @(#) is injective if and only if 
E(G) # 0 for every open set G, where E is the corresponding spectral measure. 


15. Let {A,} be a net of hermitian operators on # and suppose that there is a 
hermitian operator T such that A; < T for all i. If {(A,;h,h)} is an increasing 
net in R for every h in #, then there is a hermitian operator A such that 
A, ~ A (WOT). 


16. Show that there is a contraction tr: @(#)** > @(#) such that 7(T) = T for 
T in @(#). lf p: C(X) ~ B(#) is a representation, show that the map p in 
the proof of Theorem 1.14 is given by p(¢) = 7 ° p**(¢). 


§2. The Spectral Theorem 


The Spectral Theorem is a landmark in the theory of operators on a Hilbert 
space. It provides a complete statement about the nature and structure of 
normal operators. This accolade will be seen to hold when in Section 10 it is 
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used to give a complete set of unitary invariants. Two operators A and B 
are unitarily equivalent if there is a unitary operator U such that UAU* = B; 
in symbols, A = B. Using the Spectral Theorem, a (countable) set of objects 
are attached to a normal operator N on a (separable) Hilbert space. It is 
then shown that two normal operators are unitarily equivalent if and only if 
these objects are equal. 

The Spectral Theorem for a normal operator N on a Hilbert space with 
dim #= d < oo says that N can be diagonalized. That 1s, if a,,...,a@, are 
the eigenvalues of N (repeated as often as their multiplicities), then the 
corresponding eigenvectors e,,e5,...,e€, form an orthonormal basis for #. 
In infinite-dimensional spaces a normal operator need not have eigenvalues. 
For example, let N = multiplication by the independent variable on L7(0, 1). 
So an alternate formulation that can be generalized is desired. 

Let N be normal on #, dim #= d < oo. Let Aj,...,A,, be the distinct 
eigenvalues of N and let E, be the orthogonal projection of # onto 
ker(N — A,), 1 < k <n. Then the Spectral Theorem says that 


2.1 N= > ),E,. 


In this form a generalization is possible. Rather than discuss orthogonal 
projections on eigenspaces (which may not exist), the concept of a spectral 
measure is used; rather than the sum that appears in (2.1), an integral is 
used. It is worth mentioning that the finite-dimensional version is a corollary 
of the general theorem (see Exercise 4). 


2.2. The Spectral Theorem. Jf N is a normal operator, there is a unique 
spectral measure E on the Borel subsets of o( N) such that: 


(a) N= fzdE(z); 

(b) if G is a nonempty relatively open subset of o(N), E(G) # 0; 

(c) if A © B(H#), then AN = NA and AN* = N*A if and only if AE(A) = 
E(A)A for every A. 


PrRoor. Let w= C*(N), the C*-algebra generated by N. So is the 
closure of all polynomials in N and N*. By Theorem VIII.2.6, there is an 
isometric isomorphism p: C(o(N)) > WC B(H) given by p(u) = u(N) 
(the functional calculus). By Theorem 1.14 there is a unique spectral 
measure E defined on the Borel subsets of o( N) such that p(u) = fudE for 
all u in C(o(N)). In particular, (a) holds since N = p(z). 

If G is a nonempty relatively open subset of o( NV), there is a nonzero 
continuous function u on o(N) such that 0 < u < x,. Using Claim 1.17, 
one obtains that E(G) = p(x,¢) = p(u) # 0; so (b) holds. 

Now let A € @(#) such that AN = NA and AN* = N*A. It is not 
hard to see that this implies, by the Stone—Weierstrass Theorem, that 
Ap(u) = p(u)A for every u in C(o(N )); that is, Au(N) = u(N)A for all u 
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in C(o(N)). Let 2 = {A: A is a Borel set and AE(A) = E(A)A}. It is left 
to the reader to show that 2 1s a o-algebra. If G is an open set in o(N), 
there is a sequence {u,,} of positive continuous functions on o(N) such that 
u,(z)1T x,(2) for all z. Thus 


(AE(G)g,h) = (E(G)g, A*h) 
a E, ax(G) 


=lim fu, dE, 4; 


= lim(u,(N)g, A*h) 
= lim( Au,(N)g,h) 
= lim(u,(N) Ag, h) 


= (E(G)Ag,h). 
So 92 contains every open set and, hence, it must be the collection of Borel 
sets. The converse is left to the reader. a 


The unique spectral measure F obtained in the Spectral Theorem is called 
the spectral measure for N. An abbreviation for the Spectral Theorem is to 
say, “Let N = {A dE(A) be the spectral decomposition of N.” If ¢ is a 
bounded Borel function on o(N), define ¢(N ) by 


O(N) = ode, 
where E is the spectral measure for N. 


2.3. Theorem. Jf N is a normal operator on # with spectral measure E and 
B(o(N)) is the C*-algebra of bounded Borel functions on o(N ), then the map 
p> o(N) 
is a representation of the C*-algebra B(o(N)). If {$,} is a net in B(o(N )) 
such that {¢,dp — 0 for every p in M(o(N)), then o(N) > 0 (WOT). This 
map is unique in the sense that if t: B(o(N)) > B(#) is a representation 
such that t(z)= WN and 1(¢,) > 0 (WOT) whenever {9;} is a net in 
B(o(N)) such that {¢,dp — 0 for every p in M(o(N)), then 7(o) = o(N) 

for all @ in B(o(N)). 


Proor. The fact that ¢ — $(N) is a representation is a consequence of 
Proposition 1.12. If {¢,} is as in the statement, then the fact that FE, , © 
M(o(N)) implies that ¢,(N) ~ 0 (WOT). 

To prove uniqueness, let 7: B(o(N)) ~ &(#) be a representation with 
the appropriate properties. Then t(u) = u(N) if u © C(o(N)) by the 
uniqueness of the functional calculus for normal elements of a C*-algebra 
(VIII.2.6). If @¢ € B(o(N)), then Proposition V.4.1 implies that there is a 
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net {u,;} in C(o(N)) such that ||u;|| < ||| for all u; and fu,;du > fodp 
for every p in M(o(N)). Thus u,(NV) > o(N) (WOT). But 7(¢) = WOT 
— limt(u,) = WOT — limu,(N); therefore 7(¢) = o(N). el 


It is worthwhile to rewrite (1.11) as 


2.4 (o(N)g,h) = [odE,, 


for ¢ in B(o(N)) and g,h in #. If & € B(C), then the restriction of ¢ to 
o(N) belongs to B(o(N)). Since the support of each measure E, , is 
contained in o(N), (2.4) holds for every bounded Borel function ¢ on C. 
This has certain technical advantages that will become apparent when we 
begin to apply (2.4). 

Proposition 2.3 thus extends the functional calculus for normal operators. 
This functional calculus or, equivalently, the Spectral Theorem, will be 
exploited in this chapter. But right now we look at some examples. 


2.5. Example. If u is a regular Borel measure on C with compact support 
K, define N, on L7(u) by Nf = zf for each f in L?(p). It is easy to check 
that N*f = zf, and, hence, N, 1s normal. 


(a) o(N,) = K = support of p. (Exercise.) 
(b) If, for a bounded Borel function ¢, we define M, on L?(p) by M,f =f, 
then $(N,) = Mg. 


Indeed, this is an easy application of the uniqueness part of (2.3). 
(c) If E is the spectral measure for N,, then E(A) = M, . 
Just note that E(A) = x,(N). 


2.6. Example. Let (X,0, 1) be any o-finite measure space and put #= 
L*(X,Q,w). For ¢ in L*(") = L*(X,Q, w), define M, on # by M,f = of. 


(a) M, is normal and M¥ = Mg (II.2.8). 
(b) 6 > M, is a representation of L°(y) (VIII.S.4). 


(c) Ifo = L(p), [lol]. = |]Mgl| (11-15). 
(d) Define the essential range of ¢ by 


ess-ran(¢) = N{cl(o(A)): A € Q and wp(X\ A) = 0}. 


Then o( M,) = ess-ran(¢). (This appears as Exercise VII.3.3, but a proof 
is given here.) 

First assume that A € ess-ran(¢). So there is a set A in Q2 with w(X \ A) 
= 0 and A not in cl(¢(A)); thus, there is a 6 > 0 with |6(x) — Aj > 6 for 
all x in A. If P=(¢—2A)7*, P ] L*(m) and M, = (M, — A)7*. 

Conversely, assume A &€ ess-ran(¢). It follows that for every integer n 
there is a set 4, in (2 such that 0 < p(A,) < 00 and |¢(x) — A] < 1/n for 
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all x in A,. Put f, = (u(4,))"'/°x4 3 so f, © L*(w) and |If,|l, = 1. How- 
ever, ||(M, — A)f,Il? = (u(A4,,))” “fa 1 — Al’ du < 1/n’, showing that \ © 
0.,,(M,). 


(e) If E is the spectral measure for M, [so E is defined on the Borel subsets 
of o(M,) = ess-ran(¢) € C], then for every Borel subset 4 of 0(M,), 
E(A)= M,-1(4)- 


2.7. Proposition. Jf for k =>1, N, is a normal operator on XH, with 
sup,||N,|| < 00, E, is the spectral measure for N,, and if N= ®/°_,N, on 
H= OF _,H,, then: 


(a) o(N) = cl[UP_,0(N, )I; 
(b) if E is the spectral measure for N, E(A)= ®7_,E,(AN o(N,)) for 
every Borel subset A of o(N). 


PROOF. Exercise. 


A historical account of the spectral theorem is an enormous undertaking 
by itself. One such account is Steen [1973]. You might also consult the notes 
in Dunford and Schwartz [1963] and Halmos [1951]. 


EXERCISES 
Throughout these exercises, N is a normal operator on # with spectral measure E. 


1. Show that A €o,(N) if and only if E({A}) # 0. Moreover, if A €o,(N), 
E({A}) is the orthogonal projection onto ker(N — A). 


2. If A is a clopen subset of o(N), show that E(A) is the Riesz idempotent 
associated with A. 


. Prove Theorem II.5.1 and its corollaries by using the Spectral Theorem. 
. Prove Theorem II.7.6 and its corollaries by using the Spectral Theorem. 
. Obtain Theorem II.7.11 as a consequence of (2.3). 

. Verify the statements in Example 2.5. 

. Verify (2.6e). 


oOo YN KN TW Ee WD 


. Let A be a hermitian operator with spectral measure E on a separable space. 
For each real number f¢ define a projection P(t) = E(— 00, t). Show: 
(a) P(s) < P(t) fors <4; 
(b) if¢, <¢,,, andzt, > ¢, P(t,) ~ P(t) (SOT); 
(c) for all but a countable number of points t, P(t,,) ~ P(t) (SOT) if t, > 6 
(d) for f in C(o(A)), f(A) = {~%,f() dP(t), where this integral is to be 
defined (by the reader) in the Riemann-Stieltjes sense. 


9. Show that a normal operator N is (a) hermitian if and only if o(N) C R; (b) 
positive if and only if o(N) C [0, 00); (c) unitary if and only if o(N) C oD. 
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10. 


11. 


12. 


13. 
14. 
15. 


16. 


17. 


18. 


19. 


Show that if # is separable, there are at most a countable number of points 
{z, } in o(N) such that E(z,) # 0. By Exercise 1, these are the eigenvalues of 
N. 


Show that E(o(N)\o,(N)) = 0 if and only if N is diagonalizable; that is, 
there is a basis for # consisting of eigenvectors for N. 


Show that if N = U|N| ((N| =(N*N)'”) is the polar decomposition of N, 
U = $(N) for some Borel function ¢ on @(N). Hence U|N| = |N|U. 


Show that N = W|N| for some unitary W that is a function of N. 
Prove that if A is hermitian, exp(iA) is unitary. Is the converse true? 


Show that there is a normal operator M such that M7 = N and M = $(N) for 
some Borel function ¢. Is there only one such normal operator? 


Define N: L7(R) > L*(R) by (Nf)(t) = f(t + 1). Show that N is normal and 
find its spectral decomposition. 


Suppose that N,,...,N, are normal operators such that N;N* = N,*N, for 
1 < j,k < d. Show that there is a subset X of C4 and a spectral measure E 
defined on the Borel subsets of X such that N, = fz, dE(z) for l1<k<d 
(z, = the kth coordinate function)(see Exercise VIII.2.2). 


If N,,...,N, are as in Exercise 17 and each is compact, show that there is a 
basis for # consisting of eigenvectors for each N,. (This is the simultaneous 
diagonalization of N,,..., Ny.) 


This exercise gives the properties of Hilbert-Schmidt operators (defined below). 
(a) If {e,} and { f,} are two orthonormal bases for # and A © 4(#), then 


Lil4e I = DIAgll = Lda fd! 
(b) If A € @(#) and {e,} is a basis for #, define 


1/2 
All. = [Enea ; 


i 


By (a) ||A||, 1s independent of the basis chosen and hence is well defined. If 
||A ||, < 00, A is called a Hilbert-Schmidt operator. B, = @,(#) denotes the 
set of all Hilbert—-Schmidt operators. (c) ||A|| < ||A||, for every A in @(#) and 
| - ||, 18 a norm on @,. (d) If TE @= BH) and A € @,, then ||TA||, < 
WTUMAll, WA* lla = Alla and ||ATI| < |IAllIITII. (©) B is an ideal of @ that 
contains &,,, the finite-rank operators. (f) A © @, if and only if |A| = 
(A*A)'”? & B,; in this case ||A]], = |||A]|l,. (g) @. C By; moreover, if A is a 
compact operator and 4,,A,,... are the eigenvalues of |A|, each repeated as 
often as its multiplicity, then A © @,(#) iff L°%_,M, < oo. In this case, 
|A ||, = (2A%,)'”7. (h) If CX, Q, w) is a measure space and k € L?(p X p), let K: 
L?(p) > L?(p) be the integral operator with kernel k. Then K € @,(L7(p)) 
and ||K||, = |[A||, (see Proposition II.4.7 and Lemma II.4.8). (1) Interpret part 
(h) for a purely atomic measure space. More information on &, is contained in 
the next exercise. 
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20. This exercise discusses trace-class operators (defined below) and assumes a 


21. 


D2 


23. 


knowledge of Exercise 19. @,(4) = {AB: A and B © &,(#)}. Operators 
belonging to &,(#) are called trace-class operators and @,(4#) = &, is called 
the trace class. (a) If A € @,(#) and {e,} is a basis, then )|(Ae,, e,)| < oo. 
Moreover, the sum Li( Ae;, e;) is independent of the choice of basis. (Hint: If 
A =C*B, B,C in &,, show that |(Ae,,e,)| = 3(||Be,||7 + ||Ce,|I*).) (b) If 
{e,} is a basis for #, define tr: @, — C by 


tr( A) = XL (Ae;,e;)- 


By (a) the definition of tr(A) does not depend on the choice of a basis; tr(A) is 
called the trace of A. If dim #< oo, then tr(A) 1s precisely the sum of the 
diagonal terms of any matrix representation of A. (c) If A € @(#), then the 
following are equivalent: (1) A € &,; (2) |A| = (A*A)'” © &,; (3) JA’? € 
B,; (4) tr(JA]) < oo. (d) If A € @ and T € &, then AT and TA are in &, and 
tr(AT) = tr(TA). Moreover, tr: @, — C is a positive linear functional such that 
if A € @,, A = 0, and tr(A) = 0, then A = 0. (e) If A © &,, define |{A||,; = 
tr(|A]). If A&B, and T€ &, show that |tr(TA)| < ||TIAll)- (f) All = 
||A* ||, if A eB. (g) If TE 4 and AE &@,, then ||TA||, < ||7||IAl|, and 
ATI], < UTHMAT. Ch) || - ll, is a norm on &,. It is called the trace norm. (i) 
Z, is an ideal in #(#) that contains By. (j) If A € B, and (e,} and { f,} are 
two bases for #, then X,|(Ae,, f;)| < ||Al];. (d) 4, © B. Also, if A € B and 
A,,A5,--. are the eigenvalues of |A|, each repeated as often as its multiplicity, 
then A & &, if and only if Y°_,A,, < oo. In this case, ||A ||, = L9_,A,. (1) If A 
and B © &,, define (A, B) = tr( B*A). Then (-, -) is an inner product on 4, 
|| - |, is the norm defined by this inner product, and &, is || - ||, complete. In 
other words, #, is a Hilbert space. (m) (&,, || - ||,) is a Banach space. (n) Boy 
is dense in both @, and @,. (For more on these matters, see Ringrose [1971] 
and Schatten [1960].) 


This exercise assumes a knowledge of Exercise 20. If g,hEx#H, let g @h 
denote the rank-one operator defined by (g ®h)(f) = (f,h)g. (a) lf g,h © 
and A€ &(H#), tr(A(g @h)) = (Ag h). (6) If TE GZ, then ||T\|, = 

sup{|tr(CT)|: C € B, ||C|| < 1}.(O If T € @&, define L;: B — C by L;(C) 
= tr(TC) (= tr(CT)). Show that the map T > L, is an isometric isomorphism 
of @, onto B*. (d) If B € &, define Fz: @, — C by F(T) = tr( BT). Show 
that B > F, is an isometric isomorphism of # onto &*. (e) If L € @* show 
that L = L, + L, where Ly, L, © @*, L,(B) = tr( BT) for some T in &,, and 
L,(C) = 0 for every compact operator C. Show that |{L|| = ||Lol| + ||L,|| and 
that L, and L, are unique. 


Prove that if U is any unitary operator on #, then there is a continuous 
function u: [0,1] ~ @(#) such that u(t) is unitary for all r, u(0) = U, and 
u(1) = 1. 


If N is normal, show that there is a sequence of invertible normal operators that 
converges to N. 
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§3. Star-Cyclic Normal Operators 


Recall the definition of a reducing subspace and some of its equivalent 
formulations (Section IT.3). 


3.1. Definition. A vector eg in # is a star-cyclic vector for A if # is the 
smallest reducing subspace for A that contains e,. The operator A is star 
cyclic if it has a star-cyclic vector. A vector e, 1s cyclic for A if # is the 
smallest invariant subspace for A that contains e,; A is cyclic if it has a 
cyclic vector. 


3.2. Proposition. (a) A vector e, is a star-cyclic vector for A if and only if 
H= cl{Te): T = C*(A)}, where C*(A) = the C*-algebra generated by A. 
(b) A vector e, is a cyclic vector for A if and only if #= cl{ p(A)eo: p= a 
polynomial \. 


PROOF. Exercise. 


Note that if e, is a star-cyclic vector for A, then it is a cyclic vector for 
the algebra C*(A). 


3.3. Proposition. Jf A has either a cyclic or a star-cyclic vector, then # is 
separable. 


PROOF. It is easy to see that C*(A) and { p(A): p =a polynomial} are 
separable subalgebras of @(#). Now use (3.2). Le] 


Let » be a compactly supported measure on C and let N, be defined on 
L*() as in Example 2.5. If K = support p, then C*(N,) = {M,: ue 
C(K)}. Since C(K) is dense in L?(), it follows that 1 is a star-cyclic 
vector for N,,. The converse of this is also true. 


3.4. Theorem. 4 normal operator N is star-cyclic if and only if N is unitarily 
equivalent to N, for some compactly supported measure pw on C. If eg is a 
star-cyclic vector for N, then p can be chosen such that there is an isomorphism 
V: #2 L?*(p) with Ve, = 1 and VNV* = N,- Under these conditions, V is 
unique. 


ProoF. If N = N,, then we have already seen that N is star cyclic. So 
suppose that N has a star-cyclic vector e,. If E is the spectral measure for 
N, put p(A) = ||E(A)eoll? = (E(A)eo, e) for every Borel subset A of C 
(see Lemma 1.9). Let K = support up. 
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If ¢ € B(K ), then (2.4) implies 
O(N )egll? = (O(N Jeo, O(N Jeo) 
i (lo? CN Jeo, eo) 


= fig(2)P A E(z)e,, €o) 


= fig? du. 


So if B(K) is considered as a submanifold of L*(), Vo = o(N )e, defines 
an isometry from B(K) onto {¢(N )e,: ¢ € B(K )}. But e, 1s a star-cyclic 
vector, so the range of V is dense in #. Hence V extends to an isomor- 
phism V: L?(p,) > #. 

If o€ ee then VN,V™ '(o(N ey) = VN i(P) = V(zd) = NO@(N Jeo. 
Hence VN, V-'!=N on {b(N es: ¢€ B(K)}, "which is dense in #. So 
VN, vo} = N. 

The proof of the uniqueness statement is an exercise. a 


Any theorem about the operators N, is a theorem about star-cyclic 
normal operators. With this in mind, the next theorem gives a complete 
unitary invariant for star-cyclic normal operators. But first, a definition. 


3.5. Definition. Two measures, 1, and p,, are mutually absolutely continu- 
ous if they have the same sets of measure zero; that is, u,(4) = 0 if and 
only if w,(A) = 0. This will be denoted by [p,] = [u,]. (The more standard 
notation in the literature is yw, = w,, but this seems insufficient.) If [u,] = 
[uw], then the Radon—Nikodym derivatives du,/du, and du,/dp, are well 
defined. Say that uw, and p, are boundedly mutually absolutely continuous if 
[u,] =[,.] and the Radon—Nikodym derivatives are essentially bounded 
functions. 


3.6. Theorem. N, = N,, if and only if [v4] = [M2]. 


PROOF. Suppose [},] = [#.] and put ¢ = dp, /dp,. Soif g © L(u,), go € 
L\(u) and {gp dp, = fedty, Haice, MTS L*(u,), Vof © L*(w,) and 
iV filo = ILfllz; that is, U: L?(u,) > L?() defined by Uf = of is an 
isometry. If g © L?(y,), then f = @ '/7g © L*(p,) and Uf = g; hence U is 
surjective and U''g=9 '/*g for g a L*(p,). If g © L?(p5), then 
UN,,U~'g = UN, @ '/*g = Uzo1/* g = zg, and so UN, U" ‘= N,.. 
Now assume “that V: L*(p,) > 72 (>) is an isomorphism such that 
VN, vet = N,,. Put P= V(1); so pe L*(p,). For convenience, put IN a 
Noe. ped. eas easy to see that VN*‘V~! = NF and VN,**V = Ni 


be; 
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Hence Vp(N,, N,*)V_* = p(N,, N3*) for any polynomial p in z and 2. 
Since N, = N,, o(N,) = o(N,); hence support p, = support wp, = K. By 
taking uniform limits of polynomials in z and z, Vu(N,)V~* = u(N,) for u 
in C(K). Hence for u in C(K), V(u) = Vu(N,)1 = u(N,)V1 = ud. Be- 
cause V is an isometry, this implies that f|u|* dy, = f|ul*|y|? du, for every 
u in C(K). Hence fudp, = fv|p|?du, for v in C(K), v > 0. By the 
uniqueness part of the Reisz Representation Theorem, p, = |¥|*p,, so 
By S bo. 

By using V~' instead of V and reversing the roles of N, and N, in the 
preceding argument, it follows that yp, < p,. Hence [p,] = [p5]. a 


EXERCISES 


1. If » is a compactly supported measure on C and f € L?(p), f is a star-cyclic 
vector for N if and only if w({ x: f(x) = 0}) = 0. 


2. Prove Proposition 3.2. 


3. If , and p, are compactly supported measures on C, show that the following 
statements are equivalent: (a) w, and pw, are boundedly mutually absolutely 
continuous; (b) there is an isomorphism V: L?(p,) > L?*(,) such that VN,V-* 
= N,, and VL*(p,) = L*(p); (c) there is a bounded bijection R: L*(p,) > 
L*(,) such that Rp(z,Z) = p(z,Z) for every polynomial in z and Z. 


4. Show that if N is a star-cyclic normal operator and A € 0,(N), then dim ker(N 
—A)=1. 


5. If N is diagonalizable and star cyclic and if 6,(N) = {A,,A2, ...}, show that N 
is unitarily equivalent to N,, where p = L77_,2° "8, (see Exercise 2.11). 


6. Let N be a diagonalizable normal operator. Show that N = M if and only if M 
is a diagonalizable normal operator, o,(N) = 6,(M), and dimker(N — A) = 
dim ker( M — A) for all A. (Compare this with Theorem II.8.3.) 


7. Let U be the bilateral shift on /*(Z). If e, is the vector in /*(Z) that has 1 in the 
zeroth place and zeros elsewhere, then e, is a star-cyclic vector for U. If w is the 
compactly supported measure on C and V: /?(Z) > L?() is the isomorphism 
such that Vey = 1 and VUV"' = N,, then 
(a) » = m = normalized arc length on 0D; 

(b) V-' = the Fourier transform on L?(m) = L?( dD). 


8. Suppose N,,...,N, are normal operators such that N,N* = NN, for 1 < j,k 
< d and suppose there is a vector ey in # such that #¥ is the only subspace of 
H# containing e, that reduces each of the operators N,,..., N,. Show that there 
is a compactly supported measure p on C% and an isomorphism V: #— L*(,1) 
such that VN,V~'f =z, f for f in L?(w) and 1 <k <d (z, = the kth coordi- 
nate function) (see Exercise 2.17). 
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§4. Some Applications of the Spectral Theorem 


In this section a few diverse applications of the Spectral Theorem are 
presented. These will show the power and finesse of the Spectral Theorem as 
well as demonstrate some of the methods used to apply it. One result in this 
section (Theorem 4.6) is more than an application. Indeed, many regard this 
as the optimal statement of the Spectral Theorem. 

If N is anormal operator and N = {zdE(z) is its spectral representation, 
then ¢ > $(N) = {dE is a *-homomorphism of B(C) into 4(.#). Thus, 
if $,y © B(C), ({odE) [y dE) = [oy dE and ||/pdEl| < sup{|o(z)|: z © 
o(N)}. 


4.1. Proposition. If N is a normal operator and N = {zdE(z), then N is 
compact if and only if for every « > 0, E({z: |z| > €}) has finite rank. 


ProoFr. If ¢ > 0, let A, = {z: |z| > e} and E, = E(A,). Then 


N — NE. = [zdE(z) 2 fexa(2) dE(z) 


7 [exeya(2) dE(z)=9(N) 


where $(Z) = zXe\a(z). Thus ||N — NE,|| < sup{|z|: ze© C\4,} <«. If 
E, has finite rank for every e > 0, then so does NE,. Thus N © #)(#). 

Now assume that N is compact and let e > 0. Put $(z) =z ‘x4(2); 
so @ € B(C). Since N is compact, so is No(N). But No(N) = 
fzz~'x,(z) dE(z) = E,. Since E, is a compact projection, it must have 
finite rank. (Why?) & 


The preceding result could have been proved by using the fact that 
compact normal operators are diagonalizable and the eigenvalues must 
converge to 0. 


4.2. Theorem. If # is separable and I is an ideal of B( HA’) that contains a 
noncompact operator, then I = B( HX). 


Proor. If A GJ and A ¢ &,(#), consider A*A; let A*A = [tdE(t) 
(o(A*A) C [0, 00)). By the preceding proposition, there is an e > 0 such 
that P = E(e, 00) has infinite rank. But P = ({t~'x,,.)(t) dE(t)) A*A € I. 
Since # is separable, dim P#= dim H=%,. Let U: #H-> PH be a 
surjective isometry. It is easy to check that 1 = U*PU. But P © I, sol € I. 
Hence I = &@(#). ces 


In Proposition VIII.4.10, it was shown that every nonzero ideal of #(#) 
contains the finite-rank operators. When combined with the preceding 
result, this yields the following. 
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4.3. Corollary. If # is separable, then the only nontrivial closed ideal of 
B(H#) is the ideal of compact operators. 


The next proposition is related to Theorem VIII.5.9. Indeed, it is a 
consequence of it so that the proof will only be sketched. 

Let N be a normal operator on # and for every vector e in # let 
KH, = {N**N’e: k, 7 => 0}. So H is the smallest subspace of # that 
contains e and reduces N. Also, N|4, is a star-cyclic normal operator. 


4.4. Proposition. Jf N is a normal operator on #, then there are reducing 
subspaces { #7, i © I} for N such that #= © ,H, and N|H, is star cyclic. 


I U I 


ProoFr. Using Zorn’s Lemma find a maximal set of vectors € in # such 
that if e, fe & and e # f, then #, 1 #€. It follows that #= ©,%. isi 


4.5. Corollary. Every normal operator is unitarily equivalent to the direct sum 
of star-cyclic normal operators. 


By combining the preceding proposition with Theorem 3.4 on the repre- 
sentation of star-cyclic normal operators we can obtain the following 
theorem. 


4.6. Theorem. Jf N is a normal operator on #, then there is a measure 
space (X,92,) and a function o in L®(X,Q,p) such that N is unitarily 
equivalent to M, on L*(X, Q, p). 


PROOF. If .@ is a reducing subspace for N then N = N|.4@ N|.4@*; thus 
o(N|.4@) € a(N). So if { N;} is a collection of star-cyclic normal operators 
such that N = @,N, (4.5), then o(N,) C o(N) for every N,. By Theorem 3.4 
there is a measure p, supported on o(N) such that N, = N,. Let X; = the 
support of uw, and let 92, = the Borel subsets of X;,. Let X = the disjoint 
union of { X,}. Define {2 to be the collection of all subsets A of X such that 
AM X, © Q, for all i. It is easy to check that 2 is a o-algebra. If A € Q let 
u(A) = Xu (AO X,); then (X, Q, w) is a measure space. If f € L7(X, Q, ») 
then f, = f|X, © L?(;). Moreover, the map U: L*(w) > @,L7(p,) defined 
by Uf = ©,(/|X;) is easily seen to be an isomorphism. Define ¢: X — C by 
letting $(z)=z if ze X, (CC); since X, C o(N) for every i, @ is a 
bounded function. If G is an open subset of C, 6° '(G)N X, = GN X,€ Q;; 
hence ¢ is {2-measurable. Therefore ¢ € L*CX, Q, 1). It is left to the reader 
to check that UM,U"' = @,N, =N. & 


4.7. Proposition. [f is separable, then the measure space in Theorem 4.6 
is o-finite. 


PROOF. This is true because if L?7( X, 2, w) is separable, then (X, 2, ») must 
be o-finite. Indeed, let & be a collection of pairwise disjoint sets from 2 
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having nonzero measure. A computation shows that {("(A))'/°x 4: 4 € &} 
are pairwise orthogonal vectors in L?(p). If L?() is separable, then & 
must be countable. Therefore (_X, 92, w) is o-finite. a 


Of course if (X,{,) is finite it is not necessarily true that L?(p) is 
separable. 

The next result will be useful later in this book and it also provides a 
different type of application of the Spectral Theorem. 


4.8. Proposition. Jf ./ is an SOT-closed C*-subalgebra of B( 4), then A 
is the norm closed linear span of the projections in &. 


Proor. If A €.a%, A + A* and A — A* € W; hence # is the linear span 
of Rev. Suppose A © Rew and A = ftdE(t). If [a,b] C R, then there is 
a sequence {u,} in C(R) such thatO <u, < 1, u,(t)=1 fora<t<b- 
n-', u,(t)=0 for t<a—n™' and t>b. Hence u,(t) > x,,,4)(t) as 
n— oo. lf h € #, then 


IM @(A) — Ela, b)}AU? = flan(e) — xp a.) (4)? dE, (1) > 0 


by the Lebesgue Dominated Convergence Theorem. That is, u,(A) > 
E[a, b) (SOT). Since 7 is SOT-closed, F[a, b) € x. Now let (a, 8) be an 
open interval containing o( A). If « > 0, then there is a partition {a = ft, < 

- <t, = B} such that |t — Lf atx E,,_,.1,(| < © for ¢ in o( A); hence 
||A — UP jt, E[t,_1,¢,)|| < ¢. Thus every self-adjoint operator in  be- 
longs to the closed linear span of the projections in «7. is 


EXERCISES 


1. If N is a normal operator show that ran N is closed if and only if 0 is an isolated 
point of a(N). 


2. Give an example of a non-normal operator A such that 0 is an isolated point of 
o(A) and ran A is closed. Give an example of a non-normal operator B such that 
ran B is closed and 0 is not an isolated point of o( B). 


3. If # is a nonseparable Hilbert space find an example of a nontrivial closed ideal 
of @(#) that is different from @)(#%). 


4. Let (X,2,) be the measure space obtained in the proof of Theorem 4.6 and 
show that L'(X,Q,)* is isometrically isomorphic to L”( X, Q, 1). 


5. Show that # is separable if and only if every collection of pairwise orthogonal 
projections in @(#) is countable. 


6. If (X,2,) is a measure space, then (X,{2,) is o-finite if and only if every 
collection of pairwise orthogonal projections in { M,: ¢ © L*()} is countable. 


7. If N = {zdE(z) and e > 0, show that ran E({z: |z| > e€}) € ran N. 
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8. Let @ be a linear manifold in #% and show that -@ has the property that /@ 
contains no closed infinite-dimensional subspaces if and only if whenever A € 
8(#) and ran A CM, then A is compact. 


9. Show that the extreme points of {A € @(#): 0 < A < 1} are the projections. 


§5. Topologies on &( #) 


In this section some results on the SOT and WOT on &(#) are presented. 
These results are necessary for understanding some of the results that are to 
follow in later sections and also for a proper comprehension of a number of 
other subjects in mathematics. 

The first result appeared as Exercise 1.4. 


5.1. Proposition. [f L: B(#) — C is a linear functional, then the following 
statements are equivalent. 


(a) L is SOT continuous. 

(b) L is WOT continuous. 

(c) There are vectors g),...,8,,;,.--,h, in # such that L(A)= 
Li (Ag,,h,) for every A in B(H#). 

PrRooFr. Clearly (c) implies (b) and (b) implies (a). So assume (a). By 

(1V.3.1f) there are vectors g,,...,g, in # such that 


n n 1/2 
IL(A)| < ¥ ||Ag,l| < va | » V4 
k=1 k=1 


for every A in @(#). Replacing g, by vn g,, it may be assumed that 
1/2 


= p(A). 


n 


IL(A)l < | d lAgall? 


k=1 


Now p is a seminorm and p(A) = 0 implies L(A) = 0. Let #= cl{ Ag, ® 
Ag, ®--: @®Ag,: AGC B(H)}; 80 XH CHO --- OF (n times). Note that 
if Ag, ®--- GAg, =0, p(A)=0, and hence, L(A) =0. Thus F(Ag, 
® --- ®Ag,) = L(A) is a well-defined linear functional on a dense mani- 
fold in &. But 

|F( Ag, ® --- ®Ag,)| < p(A) = ||Ag, ® «++ @ Ag, |. 


So F can be extended to a bounded linear functional F, on # ”. Hence, 


there are vectors h,,...,h, in # such that 
Fi f, O28 Of) = (fy © Of hy O Ol, 
= e (fis h,)- 
k=1 


In particular, L(A) = F(Ag, ® --- ®Ag,) = LE_ (Ag, h,). a 
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5.2. Corollary. Jf @ is a convex subset of B(#), the WOT closure of @ 
equals the SOT closure of @. 


ProoF. Combine the preceding proposition with Corollary V.1.4. i 


When discussing the closure (WOT or SOT) of a convex set it is usually 
better to discuss the SOT. Shortly an “algebraic” characterization of the 
SOT closure of a subalgebra of @(#) will be given. But first recall 
(VIII.5.3) that if 1 <n < co, # denotes the direct sum of # with itself 
n times (NS, times if n = 00). If A € B(#H), A” is the operator on #7” 
defined by A‘”(h,,...,h,) = (Ahy,..., Ah,). If SC B(H), LM= 
{A‘: A © Ff}. It is rather interesting that the SOT closure of an algebra 
can be characterized using its lattice of invariant subspaces. 


5.3. Proposition. If < is a subalgebra of B(#) containing 1, then the SOT 
closure of & is 


5.4 {B € B(#): for every finiten, Lat wv” C Lat B’ }. 


PROOF. It is left as an exercise for the reader to show that if B © SOT — 
cl x, B belongs to the set (5.4). Now assume that B belongs to the set (5.4). 
Fix f., f5,---,f, in # and e > 0. It must be shown that there is an A in ¥ 
such that ||(4 — B)f,|| < «forl <k <n. 

Let W=V{(Af,,..., Af): A € WH}. Because ¥ is an algebra, WE 
Lat. 7”; hence #€ Lat BB”. Because 1 © Wa, (fi,...,f,) EH. Since 
{(Af,,..., 4f,); A € WM} is a dense manifold and (Bf,,..., Bf,) ].4, 
there is an A in # with e? > Li_,||\(A — B)f,||?; hence B © SOT — 
cl A. b3) 


5.5. Proposition. The closed unit ball of B( 4) is WOT compact. 


PROOF. The proof of this proposition follows along the lines of the proof of 
Alaoglu’s Theorem. For each h in ball # let X, = a copy of ball 7 with 
the weak topology. Put X = II{ X,: ||h|| < 1}. If A € ball @(%@) let 7( A) 
© X defined by 7(A), = Ah. Give X the product topology. Then tr: 
(ball #(#), WOT) > X is a continuous function and a homeomorphism 
onto its image (verify). Now show that 7(ball @()) is closed in X. From 
here it follows that ball @(3) is WOT compact. x 


EXERCISES 
1. Show that if B € SOT — cl .x#, then B belongs to the set defined in (5.4). 
2. Show that @, 1s SOT dense in &. 


3. If {A,} and {B,} are sequences in @(#) such that A, —~ A(WOT) and 
B, > B(SOT), then A,B, > AB(WOT). 
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4. With the notation of Exercise 3, show that if A, —- A(SOT), then A,B, > 
AB(SOT). 


5. Let S be the unilateral shift on /*(N) (II.2.10). Examine the sequences {.S* } and 
{ S**\ and their relation to Exercises 3 and 4. 


6. (Halmos.) Fix an orthonormal basis {e,: n > 1} for #. (a) Show that 0 © weak 
closure of {Vn e,: n> 1} (Halmos [1982], Solution 28). (b) Let {n;} be a net of 
integers such that V2 e,, > 0 weakly. Define 4; f= Vn; (fn den, for f in #. 
Show that A, > 0 (SOT) but { A?} does not converge to 0 (SOT). 


§6. Commuting Operators 


If SC BH), let S’ = {A © B(H): AS = SA for every S in S}. F’ 
is called the commutant of .. It is not difficult to see that Y’ is always an 
algebra. Similarly, Y” = (¥’)’ is called the double commutant of S. This 
process can continue, but (thankfully) Y’” =’ (Exercise 1). In some 
circumstances, S= Ff”, 

The problem of determining the commutant or double commutant of a 
single operator or a collection of operators leads to some exciting and 
interesting mathematics. The commutant is an algebraic object and the idea 
is to bring the force of analysis to bear in the characterization of this 
algebra. 

We begin by examining the commutant of a direct sum of operators. 
Recall that if #=0,0®4,0.--- and A, € @(#,) for n= 1, then 
A=A,@®A,@-:- defines a bounded operator on # if and only if 
sup,,||4,,|| < 00; in this case ||A|| = sup,||4,||. Also, each operator B on # 
has a matrix representation [B;,] where B;, € B( 4, 4). 


6.1. Proposition. (a) If A = A, ® A, @--- is a bounded operator on 
H= KH, @H,@--+ and B=([B,,| © B(#), then AB = BA if and only if 
B,,A, = A,B,, for alli, j. 

(b) If B=[B,,J € Bi H™), BA™ = AB if and only if B,,A = AB;, 
for alli, j. 


The proof of this proposition is an easy exercise in matrix manipulation 
and 1s left to the reader. 


6.2. Proposition. /f A € @(#) and 1<n< _o, then {A\™)” = (B\: 
Be {Ay} = ({A}"}™, 


PROOF. The second equality in the statement is a tautology and it is the first 
equality that forms the substance of the proposition. If B € {A}”, then the 
preceding proposition implies that B“&” © { A“ }”. Now let B € { A\™)”. 
To simplify the notation, assume n = 2. So B © {A @ A}”; let B = [B,,], 
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; 0 1 
B,, © B(#). Since ° 0 


B,, = B,, and B,, = 0. Similarly, the fact that : commutes with 


A @ A implies that B,, = 0. If C = B,, (= B), B= COC IETE {A} 
then 70 T€ {A @A}’, so B(T © T) =(T ® T)B. This shows that C © 
{A}", a 


6.3. Corollary. If PSC BH), (POW = (AY, 


© {A @ A}’, matrix multiplication shows that 


Say that a subspace .4 of # reduces a collection * of operators if it 
reduces each operator in #. By Proposition II.3.7, 4 reduces £” if and 
only if the projection of # onto 4 belongs to ’. This is important in the 
next theorem, due to von Neumann [1929]. 


6.4. The Double Commutant Theorem. Jf ./ is a C*-subalgebra of B(#) 
containing 1, then SOT — cl W= WOT —- cl W= 7”. 


ProoF. By Corollary 5.2, WOT — cl w= SOT — cl #. Also, since #” is 
SOT closed (Exercise 2) and Y%c ¥”, SOT — cl JC WH”. 

It remains to show that »&” C SOT — cl #. To do this Proposition 5.3 
will be used. 

Let BE WM”, n>1, and let We Lat wv. It must be shown that 
BUC AN. Because ¥ is a C*-algebra, so is '". So the fact that 
Me Lat” and A*™ € Vv whenever A” € .o implies that 74 
reduces A‘) for each A in &. So if P is the projection of #'” onto .4, 
Pe {Wd But BE”; so by Corollary 6.3, BY € {v'"}”. Hence 
BP = PB™ and WeELatB”. 


6.5. Corollary. If < is a SOT closed C*-subalgebra of B(H) and A © 
B( HX) such that A(P#) C P# for every projection P in x’, then A © &. 


PROOF. This uses, in addition to the Double Commutant Theorem, Proposi- 
tion 4.8 as applied to /’. Indeed, ~’ is a SOT closed C*-algebra and 
hence it is the norm-closed linear span of its projections. So if A © B(H) 
and AP#c PH for every projection P in &’, then AQ] — P)#’C (1 - 
P)H for every projection P in &’. Thus P# reduces A and, hence, 
AP = PA. By (4.8), A €¥#” = &%. = 


6.6. Theorem. If (X,2,) is a o-finite measure space and 6 © L™(), 
define M, on L*(p) by M,f = of. If 7%, = {M,: ¢ © L*(p)}, then f= 
Sh = ABN. 
PROOF. It is easy to see that if =e’, then W= eV”. Since H, CAH’, it 
suffices to show that A! CL,. So fix A in Ah; it must be shown that 
A = M, for some ¢ in L®(p). 

Case 1: p(X) < co. Here 1 © L*(u); put ¢ = A(1). Thus ¢ € L?(m). If 
ye L*(p), then pe L*(p) and A(y)= AM] = M,Al = Myo = Oy. 
Also, ||O¥||2 = ||Avll2 < IAT Pl2- 
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Let A, = {x © X: |o(x)| =n}. Putting Y = x, in the preceding argu- 
ment gives 


Ale (4,,) = AIP? = llovll? = J |? dp > n*p(A,,). 


So if w(A,) # 0, ||Al] =n. Since A is bounded, p(A4,) = 0 for some n; 
equivalently, ¢ © L*(p). But A = M, on L*(p) and L®(p) is dense in 
L’(u), so A = M, on L*(p). 

Case 2: (X) = 00. If w(A) < 00, let L?(wJA) = {fe L*(n): f= 0 off 
A\. For f in L?(p|A), Af = Ax,f=x Af € L?(plA). Let A, = the 
restriction of A to L*(p|A). By Case 1, there is a , in L®(p|A) such that 
A, = M,,. Now if w(4)) < 00 and w(4,) < 00, dy |A, 9 A, = $4 |4, 9 4, 
(Exercise). 

Write X = US_,A,, where A, € Q and p(4,) < oo. From the argument 
above, if $(x) = ¢, (x) when x €A,, $ is a well-defined measurable 
function on X. Now |[$gll,. = ||Mz,l| (IL-5) = ||Aqll < |All; hence ||9Il,, 
< ||A||. It is easy to check that A = Mg. g 


The next result will enable us to solve a number of problems concerning 
normal operators. It can be considered as a result that removes a technical- 
ity, but it is much more than that. 


6.7. The Fuglede—Putnam Theorem. Jf N and M are normal operators on # 
and X, and B: X —> # is an operator such that NB = BM, then N*B = 
BM*. 


Proor. Note that it follows from the hypothesis that N“B = BM* for all 
k > 0. So if p(z) is a polynomial, p(N)B = Bp(M). Since for a fixed z in 
C, exp(izN ) and exp(izM) are limits of polynomials in N and M, respec- 
tively, it follows that exp(izN )B = Bexp(izM) for all z in C. Equivalently, 
B =e "%Be'*™. Because exp(X + Y) = (exp X)(expY) when X and Y 
commute, the fact that N and M are normal implies that 


f(z) = e ZN" BoizM™ 


7a NR. oes =) * 
es ez e (ZN Bei?MoizM 


=, kas 5 * 
=e i(zN +2N) Boi(ZM+zM ye 


But for every z in C, zN* + ZN and zM* + ZM are hermitian operators. 
Hence exp[—i(zN* + zN)] and exp[i(zM* + zM)] are unitary (Exercise 
2.14). Therefore ||f(z)|| < ||Bl|. But f: C > @(%, #) is an entire func- 
tion. By Liouville’s Theorem, f is constant. 

Thus, 0 = f(z) = —iN*e7@N"BE@™M™ + je—?@N"BM *e=™" Putting z = 
O gives 0 = —iN*B + iBM*, whence the theorem. a 


This theorem was originally proved in Fuglede [1950] under the assump- 
tion that N = M. As stated, the theorem was proved in Putnam [1951]. The 
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proof given here is due to Rosenblum [1958]. Another proof is in Radjavi 
and Rosenthal [1973]. Berberian [1959] observed that Putnam’s version can 
be derived from Fuglede’s original theorem by the following matrix trick. If 


then L is normal on #0 # and LA = AL. Hence L*4 = AL*, and this 
gives Putnam’s version. 


6.8. Corollary. Jf N = {zdE(z) and BN = NB, then BE(A) = E(A)B for 
every Borel set A. 


PROOF. If BN = NB, then BN* = N*B; the conclusion now follows by The 
Spectral Theorem. | 


The Fuglede—Putnam Theorem can be combined with some other results 
we have obtained to yield the following. 


6.9. Corollary. Jf «. is a compactly supported measure on C, then 
(NJ =, = {Mg @ = L*(n)}. 


ProoF. Clearly ~/, C {N,}’. If A © {N,}’, then Theorem 6.7 implies AN,* 
= N.*A. By an easy algebraic argument, AM, = M,A whenever ¢ Is a 
polynomial in z and Zz. By taking weak* limits of such polynomials, it 
follows that A € .&’. By Theorem 6.6 A € #,. a 


Putnam applied his generalization of Fuglede’s Theorem to show that 
similar normal operators must be unitarily equivalent. This has a formal 
generalization which is useful. 


6.10. Proposition. Let N, and N, be normal operators on #, and XH. If X: 
KH, — XH, is an operator such that XN, = N,X, then: 


(a) cl(ran X) reduces N,; 
(b) ker X reduces N;; 
(c) If M, = N,|(ker X) and M, = N,|cl(ran X), then M, = M). 


ProorF. (a) If f, € %,, No Xf, = XN,f, © ran X; so cl(ran X) is invariant 
for N,. By the Fuglede—Putnam Theorem, XN,* = N,*X, so cl(ran X) 1s 
invariant for N,*. 

(b) Exercise. 

(c) Since X(ker X)* C cl(ran X), part (c) will be proved if it can be 
shown that N, = N, when ker X = (0) and ran X is dense. So make these 
assumptions and consider the polar decomposition of X, X = UA. Because 
ker X = (0) and ran X is dense, A is a positive operator on #%, and U: 
KH, — AH, 18 an isomorphism. Now X*N,* = N,*X*, so X*N, = N,X*. A 
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calculation shows that A* = X*X € {N,}’, so A € {N,}’. (Why?) Hence 
N,UA = N,X = UAN, = UN,A; that is, N,U = UN, on the range of A. But 
ker A = (0), so ran A is dense in #,. Therefore N,U = UN,, or N, = 
UN,U-'. 


6.11. Corollary. Two similar normal operators are unitarily equivalent. 


The corollary appears in Putnam [1951], while Proposition 6.10 first 
appeared in Douglas [1969]. 


EXERCISES 
1. If SC B(H), show that PW’ =f”. 
2. If SC B(H), show that ¥’ is always a SOT closed subalgebra of @(#%). 
3. Prove Proposition 6.1. 


4. Let # be a Hilbert space of dimension a and define S: #'©) — #™) by 
S(hy,hy,...) = (O,h,,h5, ...). S is called the unilateral shift of multiplicity a. 
(a) Show that A =[A;;] © {}’ if and only if A;,; = 0 for j >i and A,, = 
A;.1,;+1 for i > j. (b) Show that A = [A,,] © {/}” if and only if A;, = 0 for 
j>iand A;, = A;,,,;.; = a multiple of the identity for i > /. 


5. What is {N, ® N,}’? (N, ® N,}’? 


6. If is a subalgebra of 4(), show that is a maximal abelian subalgebra 
of @(#) if and only if = #’. 


7. Find a non-normal operator that is similar to a normal operator. (Hint: Try 
dim #= 2.) 


8. Let » be a compactly supported measure on C and let ¥% be a separable Hilbert 
space. A function f: C > & is a Borel function if f-'(G) is a Borel set when G 
is weakly open in %. Define L*(,.%#) to be the equivalence classes of Borel 
functions f:C > # such that f/||f(x)||? du(x) < 0. Define (f,g) = 
[(f(x), g(x)) du(x) for f and g in L?(p,.%#). (a) Show that L?(u,%) is a 
Hilbert space. Define N on L*(p, #) by (Nf )(z) = zf(z). (b) Show that N is a 
normal operator and o(N) = support p. Calculate N*. (c) Show that N = as 
where a = dim %. (d) Find {N}’. (Hint: Use 6.1.) (e) Find {N}”. 


9. Let # be separable with basis {e,,}. Let A be the diagonal operator on # 
given by Ae, =A, e,, where sup,|A,,| < oo. Determine {A}’ and {A}”. Give 


nvn? 


necessary and sufficient conditions on {A,,} such that {A}’ = {A}”. 


10. Let be a C*-subalgebra of #(#%) but do not assume that » contains the 
identity operator. Let # = V{ran A: A © W} and let P = the projection of # 
onto 4. Show that SOT — cl W=W7”P = Px”. 
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§7. Abelian von Neumann Algebras 


7.1. Definition. A von Neumann algebra & is a C*-subalgebra of B(#) 
such that A= ¥”. 


Note that if s is a von Neumann algebra, then 1 © ¥Y and w is SOT 
closed. Conversely, if 1 €.% and # is a SOT closed C*-subalgebra of 
B(H), then & is a von Neumann algebra by the Double Commutant 
Theorem. 


7.2. Examples. (a) @(#@) and C are von Neumann algebras. 

(b) If (X, Q, ) is a o-finite measure space, then Y%, = {M,: ¢ © L*()} 
C @(L?*(p)) is an abelian von Neumann algebra by Theorem 6.6. In fact, it 
is a maximal abelian von Neumann algebra. 

It will be shown in this section that .¥, is the only abelian von Neumann 
algebra up to a *-isomorphism. However, there are many others that are 
not unitarily equivalent to ¥,,. 

For 7%, C BH), j = 1, L, © LH, ® --- is used to denote the /” direct 
sum Of /,, %,,.... Thatis, 7% @Y%,8--- ={A, BA, 9° 1A CA, 
for j = 1 and sup,||A,|| < oo}. Note that 7, @<Y%,8--- CB(H, 8H, 
® ---) and ||4, ® A, ® ++ || = supillA|ll 


7.3. Proposition. (a) If °7,, %, ... are von Neumann algebras, then so is 
A, @L,®--- .(b) If L is a von Neumann algebra and 1 <n < o, then 
xf") is a von Neumann algebra. 


PROOF. Exercise. 
The proof of the next result is also an exercise. 


7.4. Proposition. Let ., be a von Neumann algebra on H#,, j = 1,2. If U: 
KH, > Hy, is an isomorphism such that Ud ,U~! = f,, then US {U* = £4. 


Now let (X, 2, u.) be a o-finite measure space and define p: 7, > AO) 
by p(T) = T © T. Then p is a *-isomorphism. However, ., and Ah) are 
not spatially isomorphic. That is, there is no Hilbert space isomorphism U: 
L*(p) > L?(p) ® L?(p) such that Us U~* = &°. Why? One way to see 
that no such U exists is to note that , has a cyclic vector (give an 
example). However, ./{° does not have a cyclic vector as shall be seen 
presently (Theorem 7.8). 


7.5. Definition. If Cc 4(#) and e, € #, then ey is a separating vector 
for »& if the only operator A in #& such that Ae, = 0 is the operator 
A = 0. 
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If (X, Q, w) is a o-finite measure space and f € L*(p) such that w({ x © X: 
f(x) = 0} = 0 (Why does such an f exist?), then f is a separating vector for 
, as well as a cyclic vector. If = 4(#), then no vector in # is 
separating for w while every nonzero vector is a cyclic vector. If “= C 
and dim #> 1, then » has no cyclic vectors but every nonzero vector is 
separating for #. 


7.6. Proposition. If e, is a cyclic vector for L%, then ey is a separating vector 
for L’. 


ProoF. If T € #’ and Te, = 0, then for every A in &W, TAe, = ATe, = 0. 
Since Vi¥Ve, = #,T=0. J 


7.7. Corollary. If < is an abelian subalgebra of B( 4), then every cyclic 
vector for & is a separating vector for &. 


PROOF. Because 7 is abelian, YC oH’. | 


Since #(#)’ = C, Proposition 7.6 explains some of the duality exhibited 
prior to (7.6). Also note that if (X, 0, ) is a finite-measure space, 1 © 0, 
0 @ 1, and1 @ 1 are all separating vectors for %/. Because 7, # (W)’, 
the next theorem says that oe has no cyclic vector. 

Although it is easy to see that conditions (a) and (b) in the next result are 
equivalent, irrespective of any assumption on #, the equivalence of the 
remaining parts to (a) and (b) is not true unless some additional assumption 
is made on # or #@ (see Exercise 5). We are content to assume that ¥ is 
separable. 


7.8. Theorem. Assume that # is separable and & is an abelian C *-subal- 
gebra of B(#). The following statements are equivalent. 


(a) xf is a maximal abelian von Neumann algebra. 

(b) S= AH’. 

(c) & has a cyclic vector and is SOT closed. 

(d) There is a compact metric space X, a positive Borel measure p with 
support X, and an isomorphism U: L?() > # such that UL,U oe GL, 


PROOF. The proof that (a) and (b) are equivalent is left as an exercise. 

(b) = (c): By Zorn’s Lemma and the separability of 0, there is a 
maximal sequence of unit vectors {e,} such that for n # m, cl[.#e,] 1 
cl[.e,,]. It follows from the maximality of {e,} that @= ®™ cl[#e,]. 

Let e, = U°_,e,/ V2". Since e, 1 e,, for n # m, |leo||? = L2~” = 1. Let 
P, = the projection of # onto # = cl[.We,]. Clearly # leaves #  in- 
variant and so, since W is a *-algebra, # reduces Y%. Thus P, € #’ =H 
and cl[.¥e,] 2 cll. P_e)] = cll ve, ] = %,. Therefore cl[.#%e,)] = # and e, 
is a cyclic vector for 7. 
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(c) = (d): Since # is separable, ball x is WOT metrizable and compact 
(1.3 and 5.5). By picking a countable WOT dense subset of ball.» and 
letting »/, be the C*-algebra generated by this countable dense subset, it 
follows that ., is a separable C *-algebra whose SOT closure is ./. Let X 
be the maximal ideal space of »~, and let p: C(X) ~ YH, CSC B(H) be 
the inverse of the Gelfand map. By Theorem 1.14 there is a spectral measure 
E defined on the Borel subsets of X such that p(u) = fudE for u in CCX). 
If ¢ € B(X) and {u,} is a net in C(X) such that fu;dv > {dv for every 
y in M(X), then p(u,)= fu;dE ~ {¢dE (WOT). Thus {/¢dE: ¢e€ 
B(X)} € H since # 1s SOT closed. 

Let e, be a cyclic vector for & and put p(A) = ||E(A)epl|* = 
(E(A)eo, eo). Thus ((/odE)eo,e9) = fodp for every @ in BCX). Con- 
sider B(X) as a linear manifold in L*() by identifying functions that agree 
ae. [p]. If 6 © BCX), then 


Joa) “= ((foae)’ | fodt eaves 


= flor ap. 


This says two things. First, if @ = 0 a.e. [uw], then ( {¢@dE)e, = 0. Hence U: 
B(X) > # defined by Ud = ({¢dE)e, is a well-defined map from the 
dense manifold B(X) in L?() into #. Second, U is an isometry. Since the 
domain and range of U are dense (Why?), U extends to an isomorphism U: 
L*(p) > #. 

If ¢& B(X) and YE L*(n), then UM,o = U(Vd) = (f¥gdE Jey = 
(fP dE) fodE Je, = (fy dE)U¢. Hence UM,U~* = {o¢dE and Ust,U~* G 
. On the other hand, Ux%,U~* is a SOT closed C*-subalgebra of #(#”) 
that contains UC(X)U~' = ,. (Why?) So U%,U~* = 2. 

Because .%, is separable, X is metrizable. 

(d) = (b): This is a consequence of Theorem 6.6 and Proposition 7.4. 

a 


7.9. Corollary. If is an abelian C*-subalgebra of B(H#) and XH is 
separable, then & has a separating vector. 


PRoor. By Zorn’s Lemma, is contained in a maximal abelian C *-alge- 
bra, &%,. It is easy to see that #W,, must be SOT closed, so ~%, 1s a 
maximal abelian von Neumann algebra. By the preceding theorem, there is 
a cyclic vector e, for Y,,. But (7.7) eo is separating for %,,, and hence for 
any subset of 7. a 


The preceding corollary may seem innocent, but it 1s, in fact, the basis for 
the next section. 
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EXERCISES 


l, 
2 


10. 


11. 


Prove Proposition 7.3. 


Prove Proposition 7.4. 


. Why are 4, and . not spatially isomorphic? 


. Show that if X is any compact metric space, there is a separable Hilbert space 


H and a *-monomorphism 7: C(X) ~ @(#). Find the spectral measure for 7. 


. Let © be an abelian C*-subalgebra of 4(3) such that /’ contains no 


uncountable collection of pairwise orthogonal projections. Show that the follow- 
ing statements are equivalent: (a) &Y is a maximal abelian von Neumann 
algebra; (b) =’; (c) Ww has a cyclic vector and is SOT closed; (d) there is a 
finite-measure space (X, 02,4) and an isomorphism U; L?(p) ~ # such that 
Uxt,U ee A 


. Let {P,} be a sequence of commuting projections in 4(#%) and put A = 


Lr 3° "(2P, — 1). Show that C*(A) is the C*-algebra generated by { P,}. (Do 
you see a connection between A and the Cantor—Lebesgue function?) 


. If # is an abelian von Neumann algebra on a separable Hilbert space #, show 


that there is a hermitian operator A such that = equals the smallest von 
Neumann algebra containing A. (Hint: Let { P,} be a countable WOT dense 
subset of the set of projections in & and use Exercise 6. This proof is due to 
Rickart [1960], pp. 293-294.) 


. If X 1s a compact space, show that C(X) is generated as a C*-algebra by its 


characteristic functions if and only if X is totally disconnected. If A is as in 
Exercise 6, show that o(A) is totally disconnected. 


. If X and Y are compact spaces and t: C(X) — C(Y) is a homomorphism with 


T(1) = 1, show that there is a continuous function ¢: Y > X such that 7(u) = 
uo for every u in C(X). Show that 7 is injective if and only if ¢ is surjective, 
and, in this case, 7 is an isometry. Show that 7 is surjective if and only if ¢ is 
injective. 


Let X and Z be compact spaces, Y= X X Z, and let ¢: Y— X be the 
projection onto the first coordinate. Define 7: C(X) > C(Y) by t(u) = ucg. 
Describe the range of rf. 


Adopt the notation of Exercise 9. Define an equivalence relation ~ on Y by 
saying y, ~ y> if and only if $(y,) = $(),). Let g: Y > Y/~ be the natural 
map and gy: C(Y/~) — C(Y) the induced homomorphism. Show that there is 
a *-epimorphism p: C(X) —~ C(Y/~) such that the diagram 


C(x) — > C(Y) 


PN 7 q* 
C(Y/~) 


commutes. Find the corresponding injection Y/~ — X. 
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12. If X is any compact metric space, show that there is a totally disconnected 
compact metric space Y and a continuous surjection @: Y ~ X. (Hint: Start by 
embedding C(X) into #(#) and use Exercises 7 and 8.) 


13. Show that every totally disconnected compact metric space is the continuous 
image of the Cantor ternary set. (Do this directly; do not try to use C*-algebras.) 
Combine this with Exercise 12 to get that every compact metric space is the 
continuous image of the Cantor set. 


14. If . is an abelian von Neumann algebra and X is its maximal ideal space, then 
X is a Stonean space; that is, if U is open in X, then clU is openin X, 


15. This exercise assumes Exercise 2.21 where it was proved that @* = &. When 
referring to the weak* topology on # = 4(#), we mean the topology # has as 
the Banach dual of @,. (a) Show that on bounded subsets of # the weak* 
topology = WOT. (b) Show that a C*-subalgebra of @(#) is a von Neumann 
algebra if and only if & is weak* closed. (c) Show that WOT and the weak* 
topology agree on abelian von Neumann algebras. (d) Give an example of a 
weak* closed subspace of #@ that is not WOT closed. 


§8. The Functional Calculus for Normal Operators: 
The Conclusion of the Saga 


In this section it will always be assumed that 
all Hilbert spaces are separable. 


Indeed, this assumption will remain in force for the rest of the chapter. This 
assumption is necessary for the validity of some of the results and minimizes 
the technical details in others. 

If N is a normal operator on #, let W*(N) be the von Neumann 
algebra generated by N. That is, W*(N) is the intersection of all of the von 
Neumann algebras containing N. Hence W*(N) is the WOT closure of 
{p(N, N*): p(z, Z) 1s a polynomial in z and 2}. 


8.1. Proposition. Jf N is a normal operator, then W*(N)= {N}" D 
{O(N ): 6 & B(o(N))}. 


PROOF. The equality results from combining the Double Commutant Theo- 
rem and the Fuglede—Putnam Theorem. If ¢ € B(o(N)), N = {zdE(z), 
and T<€ {N}’, then T € {N, N*}’ by the Fuglede—Putnam Theorem and 
TE(A) = E(A)T for every Borel set A by the Spectral Theorem. Hence 
To(N) = O(N)JT since ¢(N) = {odE. | 


The purpose of this section is to prove that the containment in the 
preceding proposition is an equality. In fact, more will be proved. A 
measure » whose support is o( N) will be found such that @(N) is well 
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defined if ¢ © L™(p) and the map @ > ¢$(N) is a *-isomorphism of L®(p) 
onto W*(N). To find p, Corollary 7.9 (which requires the separability of 
H ) 1s used. 

By Corollary 7.9, W*(N), being an abelian von Neumann algebra, has a 
separating vector e,. Define a measure p, on o(N) by 


8.2 u(A) = (E(A)eo, eo) = IIE(A) ell 


8.3. Proposition. u(4) = 0 if and only if E(A) = 0. 


PrRoor. If p(4)=0, then E(A)ey = 0. But F(A) = x,(N) © W*(N). 
Since é@, is a separating vector, E(4) = 0. The reverse implication is clear. 
a 


8.4. Definition. A scalar-valued spectral measure for N is a positive mea- 
sure p, on o(N) such that w(4) = 0 if and only if E(A) = 0; that is, uw and 
E are mutually absolutely continuous. 


So Proposition 8.3 says that scalar-valued spectral measures exist. It will 
be shown (8.9) that every scalar-valued spectral measure is defined by (8.2) 
where e, is a separating vector for W*(N). In the process additional 
information is obtained about a normal operator and its functional calculus. 

Ifhe #, let wp, = E, , and let 4, = cl[W*(N)h]. Note that 7%, is the 
smallest reducing subspace for N that contains h. Let N, = N|#%,. Thus N, 
is a *-cyclic normal operator with *-cyclic vector h. The uniqueness of the 
spectral measure for a normal operator implies that the spectral measure for 
N, 18S E(A)|74,; that is, x ,(N,) = x,(V)|%4, = E(A)|%,. Thus Theorem 
3.4 implies there is a unique isomorphism U,: #%, > L?(,) such that 
U,h =1 and U,N,U,'f = zf for all f in L?(u,). The notation of this 
paragraph is used repeatedly in this section. 

The way to understand what is going on is to consider each N, as a 
localization of N. Since N,, is unitarily equivalent to M, on L?(u,) we can 
agree that we thoroughly understand the local behavior of N. Can we put 
together this local behavior of N to understand the global behavior of N? 
This is precisely what is done in §10. 

In the present section the objective is to show that if h is a separating 
vector for W*(N), then the functional calculus for N is completely de- 
termined by the functional calculus for N,. The sense in which this 
“determination” is made is the following. If A € W*(N), then the defini- 
tion of #, shows that A%, C #,. Since A* © W*(N), #, reduces each 
operator in W*(N); thus A|#@, is meaningful. It will be shown that the 
map A > A|#, is a *-isomorphism of W*(N) onto W*(N,) if h is a 
separating vector for W*(N). Since N, is *-cyclic, Theorem 6.6 and 
Corollary 6.9 show how to determine W *(N, ). 

We begin with a modest lemma. 
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8.5. Lemma. [fh € # and p,:; W*(N) > W*(N,) is defined by p,(A) = 
A|#0,, then p, is a *-epimorphism that is WOT-continuous. Moreover, if 
WY € B(o(N)), then p,(W(N)) = Y(N,) and if A © W*(N), then there is a 
g@ in B(o(N,)) such that p,(A) = oCN,). 


PRooF. First let us see that p, maps W*(N) into W*(N,). If p(z,Z) 1s a 
polynomial in z and Zz, then p,[ p(N, N*)] = p(N,, N,*) as an algebraic 
manipulation shows. If {p,;} is a net of such polynomials such that 
p,(N, N*) — A(WOT), then for f,g in #4, (p(N,N*)f,g) > (Af, 2); 
thus p,(N,, N,*) > p,(A)(WOT) and so p,(A) © W*(N,). It is left as an 
exercise for the reader to show that p, is a *-homomorphism. Also, the 
preceding argument can be used to show that p, is WOT continuous. 

If ~ & B(o(N)), there is a net { p,(z, Z)} of polynomials in z and z such 
that {/p,dv — fy dv for every v in M(o(N)). (Why?) Since o(N,) € o(N) 
(Why?), {p,dyn — {dn for every 7 in M(o(N,)). Therefore p,(N, N*) > 
Y(N\WOT) and p,(N,, N,*) > ¥(N, (WOT). But ,(p,(N, N*)) = 
P(N, N;*) and Pal PAN, N*)) > p,(¥(N )); hence p,( P(N )) = Y(N,,). 

Let U,: 3%, — L?(p,) be the isomorphism such that U,i = 1 and 
UN US Na, If Ae W*(N) and A, =9,(A), then A,N, = N,A,; 
thus U,A,U,*° € {N,,,}’. By Corollary 6. 9, there is a ¢ in B(o(N,)) such 
that U,A,U, ' = Mg. Tt follows (How?) that A, = $(N,). 

Finally, to show tat p;, 1S surjective note that if BE W*(N,), then (use 
the argument in the preceding paragraph) B=y(N,) for some y in 
B(o(N,)). Extend y to o(N) by letting ~ = 0 on o(N)\o(N,). Then 
V(N) © W*(N) and p,(¥(N))=¥(N,)= Bo 


8.6. Lemma. [fe € # such that p, is a scalar-valued spectral measure for N 
and if v is a positive measure on o(N) such that v « p.,, then there is anh in 
KH such that v = p,. 


PROOF. This proof is just an application of the Radon—Nikodym Theorem 
once certain identifications are made; namely, f = [dv/dp,]'/* © L*(n.), 
so put h = U;'f. Hence h € %. For any Borel set A, »(A) = fx, dv = 


Ixaffdu. = (Mf, f) = (US'M, f, USP) = (E(A)h,h) = 4,(4). Oo 
8.7. Lemma. W*(N)= {$(N): ¢ © B(o(N ))}. 


Proor. Let #%= {o(N): 6 © B(o(N))}. Hence # is a *-algebra and 
& < W*(N) by Proposition 8.1. Since N € © it suffices to prove that 7 1s 
WOT closed. Let {¢;} be a net in B(o(N)) such that ¢,(N) — A (WOT); 
so A € W*(N). By (8.5) $,(N,) ~ A|%,(WOT) for any h in #. Also, by 
Lemma 8.5, for every h in # there is a ¢, in B(C) such that A|%, = 
¢,(N,). Fix a separating vector e for W*(N); hence pw, is a scalar-valued 
spectral measure for N. 

If h € #, then the fact that $,(N,) > ¢,(N,)(WOT) implies ¢, > 9, 
weak* in L*”(p,). Also, ¢, > 6 weak* in L™(p,). But uw, < p, so that 
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du,/du. © L'(p.); hence for any Borel set A 
Ap 
Joan, = [oan du. > [oeduy. 
But also 
[oidu, = fondue 
A A 


So 0 = {,(o — ,) dp, for every Borel set A. Therefore $, = 9, a.e. [u,]. 
But if ge#,, then (6,(N,)8, 8) = ($,(N)8, 8) = fo,du, = Jo.du, 
since #, <p. Thus ($,(N,)8, 8) = (¢-(N,)8,8); that is, $,(N,) = 
,(N,). In particular, Ah = $,(N,)h = $,(N,)h = 6,.(N )h. Since h was 
arbitrary, A = $,(N). a 


8.8. Corollary. If p,:; W*(N) > W*(N,) is the *-epimorphism of Lemma 
8.5, then kerp, = {o(N): ¢ = 0 ae. [p,]}. 


8.9. Theorem. Jf N is a normal operator and e <= #, the following state- 
ments are equivalent. 


(a) e is a separating vector for W*(N). 

(b) p, is a scalar-valued spectral measure for N. 

(c) The map p,: W*(N) > W*(N.) defined in (8.5) is a *-isomorphism. 
(d) {6 © B(o(N)): (N) = 0} = (6 © B(o(N)): o = 0 ae. [n,]}. 


PROOF. (a) => (b): Proposition 8.3. 

(b) = (c): By Lemma 8.5, p, is a *-epimorphism. By Corollary 8.8, 
kerp, = {¢(N): ¢=0 ae. [p,]}. But if ¢ = 0 ae. [u,], (b) implies that 
@ = 0 off a set A such that E(A) = 0. Thus ¢(N) = f[,odE = 0. 

(c) = (d): Combine (c) with Corollary 8.8. 

(d) = (a): Suppose A € W*(N) and Ae = 0. By Lemma 8.7, there is a @ 
in B(o(N)) such that $(N) =A. Thus, 0 = ||Aell? = (A*Ae,e) = 
{\o|- du. Sod =O0ae.[u,]. By (d), 4=0. 


These results can now be combined to yield the final statement of the 
functional calculus for normal operators. 


8.10. The Functional Calculus for a Normal Operator. Jf N is a normal 
operator on the separable Hilbert space # and yp is a scalar-valued spectral 
measure for N, then there is a well-defined map p: L®(n) > W*(N) given 
by the formula p(¢) = ¢(N) such that 


(a) p is a *-isomorphism and an isometry; 
(b) p: (L™(p), weak*) — (W*(N), WOT) is a homeomorphism. 


PROOF. Let e be a separating vector such that p = p, [by (8.6) and (8.9)]. If 
¢ € B(o(N)) and ¢ = Oa. [p], then ¢(N) = 0 by (8.9d); so p(o) = o(N) 
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is a well-defined map. It is left to the reader to show that p is a *-homo- 
morphism. By Lemma 8.7, p is surjective. Also, if p(¢) = ¢(N) = 0, then 
@ = 0 ae. [wu] by (8.9d). Thus p is a *-isomorphism. By (VIIJI.4.8) p is an 
isometry. (A proof avoiding (VIII.4.8) is possible—it is left as an exercise.) 
This proves (a). 

Let {¢,} be a net in L®(p) and suppose that ¢,(N) > O(WOT). If 
fe L(p) and f= 0, fu<p=uz,. By Lemma 8.6 there is a vector h such 
that fu=p,. Thus /o,fdp = {¢,du, = (¢,(N)h,h) > 0. Thus ¢, > 0 
(weak*) in L®(y). This proves half of (b); the other half is left as an 
exercise. a 


8.11. The Spectral Mapping Theorem. Jf N is a normal operator on a 
separable space and p is a scalar-valued spectral measure for N and if 
@ € L™(p), then o(o(N)) = the p-essential range of 9. 


PRooF. Use (8.10) and the fact (2.6) that the p-essential range of ¢ is the 
spectrum of @ as an element of L®(). is 


8.12. Proposition. Let N, 1, ¢ be as in (8.11). If N = [zdE, then poo? is 
a scalar-valued spectral measure for 6(N) and E © $' is its spectral measure. 


PROOF. Exercise. 


EXERCISES 
1. What is a scalar spectral measure for a diagonalizable normal operator? 


2. Let N, and N, be normal operators with scalar spectral measures p, and p,. 
What is a scalar spectral measure for N, ® N,? 


3. Let {e, } be an orthonormal basis for # and put p(A) = £2 ,2 "||E(A)e, ||’. 
Show that p is a scalar spectral measure for N. 


4. Give an example of a normal operator on a nonseparable space which has no 
scalar-valued spectral measure. 


5. Prove that the map p in (8.10) is an isometry without using (VIII.4.8). 
6. Prove Proposition 8.12. 


7. Show that if » and vy are compactly supported measures on C, the following 
statements are equivalent: (a) N,@®WN, is *-cyclic; (b) W*(N,@N,)= 
W*(N,) ® W*(N,); (C) wv. 


8. If M and WN are normal operators with scalar spectral measures p and »?, 
respectively, show that the following are equivalent: (a) W*(M@®WN)= 
W*(M)®W*(N); (bt) {MON} = {MY @ {N}’; (c) there is no operator A 
such that MA = AN other than A = 0; (d) p 1 »v. 


9. If M and N are normal operators, show that C*(M @ N) = C*(M) ® C*(N) 
if and only if o(M) No(N) =0C. 
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10. Give an example of two normal operators M and N such that W*(M ® N) = 
W(M) ® W*(N) but C*(M @ N) # C*(M) @C*(N). In fact, find M and 
N such that W*(M @® N) splits, but o(M) = o(N). 


11. If U is the bilateral shift and V is any unitary operator, show that W*(U ® V) 
= W*(U) ® W*(V) if and only if V has a spectral measure that is singular to 
arc length on 0D. 


12. If & is an abelian von Neumann algebra on a separable space, show that there 
is a compactly supported measure » on R such that #W is *-isomorphic to 
L”(). (Hint: Use Exercise 7.7.) 


13. (This exercise assumes a knowledge of Exercise 2.21). Let N = {zdE(z) bea 
normal operator with scalar-valued spectral measure p and define a: 4,(#) > 
L'(p) by a(T)(A) = tr(TE(A)). Show that a is a surjective contraction. What is 
a*? [L'(w) is identified, via the Radon-Nikodym Theorem, with the set of 
complex-valued measures that are absolutely continuous with respect to p.] 


§9. Invariant Subspaces for Normal Operators 


Remember that we continue to assume that all Hilbert spaces are separable. 

Every normal operator on a Hilbert space of dimension of at least 2 has a 
nontrivial invariant subspace. This is an easy consequence of the Spectral 
Theorem. Indeed, if N = {zdE(z), E(A)# is a reducing subspace for 
every Borel set A. 

If A &€ B(H#), 4 is a linear subspace of #, and P is the orthogonal 
projection of # onto 4, then .@ reduces A if and only if P © { A}’. Also, 
M = Lat A (= the lattice of invariant subspaces for A) if and only if 
AP = PAP. Since the spectral projections of a normal operator belong to 
W *(N), they are even more than reducing. 


9.1. Definition. An operator A 1s reductive if every invariant subspace for 
A reduces A. Equivalently, A is reductive if and only if Lat A = Lat A*. 


Thus, every self-adjoint operator is reductive. Every normal operator on a 
finite-dimensional space is reductive. More generally, every normal compact 
operator is reductive (Ando [1963]). However, the bilateral shift is not 
reductive. Indeed, if U is the bilateral shift on /7(Z), #= { f € 1°(Z): 
f(n) = 0 if n < 0} © LatU, but #¢ LatU*. Wermer [1952] first studied 
reductive normal operators and characterized the reductive unitary oper- 
ators. A first step towards characterizing the reductive normal operators will 
be taken here. The final step has been taken but it will not be viewed in this 
book. The result is due to Sarason [1972]. Also see Conway [1981], § VII.5. 


9.2. Definition. If u is a compactly supported measure on C, P™(p) 
denotes the weak* closure of the polynomials in L®(,). 
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Because the support of pu 1s compact, every polynomial, when restricted to 
that support, belongs to L™(p). 

For any operator A, let W(A) denote the WOT closed subalgebra of 
B(H) generated by A; that is, W(A) is the WOT closure in @(.#%) of the 
polynomials in A. The next result is an immediate consequence of The 
Functional Calculus for Normal Operators. 


9.3. Theorem. Jf N is a normal operator and is a scalar-valued spectral 
measure for N, then the functional calculus, when restricted to P™(), is an 
isometric isomorphism p: P~(p) > W(N) and a weak*-WOT homeomor- 
phism. Also, p(z) = N. 


9.4. Definition. An operator A is reflexive if whenever B © @(%) and 
LatA C Lat B, then B © W(A). 


It is easy to see that if B © W(A), then Lat A C Lat B (Exercise). An 
operator is reflexive precisely when it has sufficiently many invariant sub- 
spaces to characterize W( A). For a survey of reflexive operators and some 
related topics, see Radjavi and Rosenthal [1973]. 


9.5. Theorem. (Sarason [1966].). Every normal operator is reflexive. 


PROOF. Suppose N is normal and Lat N C Lat A. If P is a projection in 
{N}’, then P# and (P3#/)~ € Lat N C Lat A, so AP = PA. By Corollary 
6.5, 4 © W*(N). Let pw be a scalar-valued spectral measure for N. By 
Theorem 8.10, there is a ¢ in L®(p) such that A = o(N). By Theorem 9.3, 
it must be shown that ¢@ € P™(). 

Now let’s focus our attention on a special case: Assume that MN 1s 
*-cyclic; thus N = N,. Suppose f € L'(w) and {fy dp = 0 for every p in 
P~(). If it can be shown that {f¢dp = 0, then the Hahn—Banach Theo- 
rem implies that @ © P%(p). This is the strategy we follow. Let f = gh for 
some g,h in L*(p). Put #W=V{z*g: k = 0}. Clearly “Ze Lat N, so 
M = Lat A = Lato(N) = LatM,. Hence ¢g€.4%. But 0= fz*fdu= 
{z*ghdu = (N*g,h) for all k > 0; hence h 1.4. Thus 0 = ($g,h) = 
foghdp = {ofdp, and ¢ © P™(p). 

Now we return to the general case. By Theorem 8.9 there is a separating 
vector e for W*(N) such that p(A) = ||E(A)el|?, where N = [zdE(z). Let 
t= V{N**Ne: k, j = 0}. Clearly # reduces N and N|X is *-cyclic. 
Hence “& reduces A and A|\¥= o(N|#). By the preceding paragraph 
oeP(n) ff 


An immediate consequence of the preceding theorem is the first step in 
the characterization of reductive normal operators. 
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9.6. Corollary. Jf N is a normal operator and yp is a scalar spectral measure 
for N, then N is reductive if and only if P®(w) = L®(p). 


Proor. If @€ Lat N, then @#€ Lat¢d(N) for every ¢ in P*(p). So if 
P“(nw) = L@(p), ze P~(p) and, hence, M€ LatN* whenever WE 
Lat N. 

Now suppose WN is reductive. This means that Lat N C Lat N*. By the 
preceding theorem, this implies N* € W(N); equivalently, z © P”(p). 
Since P™() is an algebra, every polynomial in z and Z belongs to P™”(p). 
By taking weak* limits this implies that P®(w) = L®(1). | 


The preceding corollary fails to be a good characterization of reductive 
normal operators since it only says that one difficult problem is equivalent 
to another. A way is needed to determine when P”(p) = L*(). This is 
what was done in Sarason [1972]. 

Are there any reductive operators that are not normal? This natural and 
seemingly innocent question has much more to it than meets the eye. Dyer, 
Pedersen, and Porcelli [1972] have shown that this question has an affirma- 
tive answer if and only if every operator on a Hilbert space has a nontrivial 
invariant subspace. 


EXERCISES 


1. (Ando [1963].) Use Corollary 9.6 to show that every compact normal operator is 
reductive. 


2. Determine all of the invariant subspaces of a compact normal operator. 


3. (Rosenthal [1968].) Show that every reductive compact operator is normal. 


§10. Multiplicity Theory for Normal Operators: 
A Complete Set of Unitary Invariants 


Throughout this section only separable Hilbert spaces are considered. 

When are two normal operators unitarily equivalent? The answer to this 
question must be given in the following way: to each normal operator we 
must attach a collection of objects such that two normal operators are 
unitarily equivalent if and only if the two collections are equal (or equiv- 
alent). Furthermore, it should be easier to verify that these collections are 
equivalent than to verify that the normal operators are equivalent. This is 
contained in the following result due to Hellinger [1907]. Note that it 
generalizes Theorem 3.6. 
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10.1. Theorem. (a) Jf N is a normal operator, then there is a sequence of 
measures { t,\ ( possibly finite) on C such that p,., <« p, for all n and 


10.2 N2=N, @N,, ®--- 


(b) If N and {,,} are asin (a) andM=N, ®N, ® ---, where v,., < », 
for all n, then N = M if and only if [p,,| = [v,] for all n. 


The proof of this theorem requires several lemmas. Before beginning, we 
will examine a couple of false starts for a proof. This will cause us to arrive 
at the correct strategy for a proof and show us the necessity for some of the 
lemmas. 

Let N = fzdE(z). If e€ # and 4 =cl[W*(N )e], then NIX is a 
*-cyclic normal operator. An application of Zorn’s Lemma and the sep- 
arability of # produces a maximal sequence {e,} in # such that 
KH, 1 4, . By the maximality of e,, #= 0,#% .If N, = Nj, N, =N,. 
where Ln (A) = = ||E(A)e,||*; thus N = ®,N,. Theinouble here 4 that ee 
is not necessarily absolutely continuous with respect to p,,. Just using Zorn’s 
Lemma to produce the sequence {p,,} eliminates any possibility of having 
{p,,} canonical and producing the unitary invariant desired for normal 
operators. Let’s try again. 

Note that if u,,, <u, for all » in (10.2), then uw, < p, for all n. This 
in turn implies that p, is a scalar-valued spectral measure for N. Using 
Lemma 8.6 we are thus led to choose p, as follows. Let e, be a separating 
vector for W*(N); this exists by Corollary 7.9 and the separability of #. 
Put p,(A) = ||JE(A)e,||*. If 2, = cl[W*(N )e,], then NIH, = N,,- Let 
N, = N|3@,~; so N, is normal. A pair of easy exercises shows that the 
spectral measure E, for N, is given by E,(4) = E(A)|#7,~> and W*(N,) = 
W*(N)|A> (= {A€ BA): A=T|H,° for some T in W*(N)}). 
Let e, be a separating vector for W*(N,) and put p,(A) = ||E,(A)e,||?. By 
the easy exercises above, p.,(A) = ||E(A)e,||’, so that wu, < p,, and % = 
cll[W *(N )e,] = cll[W*(N,)e,] < %” . Also, NIH, = N,. 

aap tess in this way produces a sequence of vectors {e, } such that if 

= cl[W*(N)e,] and w,(A) = ||E(A)e,||?, then @ 1%, for n#m, 
i 41 <u, and N|#, = N,. The difficulty here is that # is not neces- 
sarily equal to ®, #%, so ‘that N and © ,N,, cannot be proved to be 
unitarily equivalent. (Actually, N and ® WN, are unitarily equivalent, but 
to show this we need the force of Theorem 10.1. See Exercise 2.) The 
following provides us with a look at an example to see what can go wrong. 


10.3. Example. For n > 1 let u, = Lebesgue measure on [0,1 + 2° "] and 
let w,, = Lebesgue measure on [0,1]. Put N= @7,N, ON, . If the 
process of the preceding paragraph is followed, it might Ge that vectors {e,} 
that are chosen are the vectors with a 1 in the L*(,) coordinate and zeros 
elsewhere. Thus the spaces {3%} are precisely the spaces { L*(p,,)} and 
[BPH] ~ = L*(h..). 
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(Nevertheless, N = @YN, . Indeed, let », = w,|[1,1 + 2° "]; sou, = B, 
+ py, and p,, »,. Thus N, = N, ® N, . Therefore 


N= ®FN, ON, 


le 


= ON, ONION. 
= DN, ON 
= OPN, .) 


After an examination of the statement of Theorem 10.1, it becomes clear 
that some procedure like the one outlined in the paragraph preceding 
Example 10.3 should be used. It only becomes necessary to modify this 
procedure so that the vectors {e,} can be chosen in such a way that 
H= OFH,. For example, let, as above, e, be a separating vector for 
W*(N) and let { f, } be an orthonormal basis for # such that f, = e,. We 


now want to choose the vectors {e,} such that {fi,...,.f,}CH 
® --- @#,. In this way we will meet success. The vital link here is the next 
result. 


10.4. Proposition. Jf N is a normal operator on # and e € #, then there is 
a separating vector e, for W*(N) such that e € cl[W*(N )ep]. 


PrRooFr. Let fp) be any separating vector for W*(N), let E be the spectral 
measure for N, let w(A) = ||/E(A)f||’, and put Y= cl[W*(N)f)]. Write 
e=g,+h,, where 2g, ©¥Yandh,€G-. 

Let (A) = ||E(A)h,||* and let A= cl[W*(N)h,]. Hence y « p, N is 
reduced by both ¥ and Y, and #1 Y. Moreover, N|Y = N, and N| Y= N,. 
Now the fact that » < p implies that there is a Borel set A such that 
[n] = [p|4]. (Why?) Hence NIY= N, if v = plA (Theorem 3.6). Let U: 
G@ L— L*(p) ® L*(v) be an isomorphism such that U(N|Y¥ ® Z)U~! = 
N,@®N,. Since e=g, +h, Ee GOP, let Ve=g@h. Because h, is a 
* -cyclic vector for N|IY, h(z) # 0 ae. [pv]. 

This reduces the proof of this proposition to proving the next lemma. 


10.5. Lemma. Let p be a compactly supported measure on C, A a Borel 
subset of the support of w, and put v = p|A. IfN = N, ® N, on L*(p) © L*(v) 
and g ®h € L*(u) ® L?(v) such that h(z) # 0 a.e. [v], then there is an f in 
L*(u) such that f ®h is a separating vector for W*(N) and g®he 
cl[W*(N \(f ® h)). 


ProoFr. Define f(z) = g(z) for z in A and f(z)=1 for z not in A. Put 
KH= ilW*(N)(f 6 h)] = cl{ of ® oh: @ € L™(p)} since p is a scalar-val- 
ued spectral measure for N. If A’ = the complement of A, then note that 
oxy ®0= ox,4(f Oh) € # forall d in L®(p,). Hence L7(p|d’) 8 0 C H#. 
This implies that (1 — g)x,y ®@0EH,sogeh=feh—(1-—g)x,e0 
cH. 
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On the other hand, if ¢ € L*(u) and 0 = $f © oh, then of = oh = 0 
a.e. [uw]. Since h(z) #0 ae. [v], $(z) = 0 ae. [uw] on A. But for z in A’, 
f(z) = 1; hence $(z) = 0 ae. [p] on A’. Thus, f © h/ is a separating vector 
for W*(N). iz 


PROOF OF THEOREM 1.10 (a): Let e, be a separating vector for W*(N) and 
let {f,} be an orthonormal basis for # such that f, =e,. Put #, = 
cW*(N)e,], w,(A) = ||E(A)e,||°, and N, = Nit. Let f’ = the or- 
thogonal projection of f, onto #,* . By Proposition 10.4 there is a separat- 
ing vector e, for W*(N,) such that f,’ © cl[W *(N,)e,] = A. Note that 
H,, = AW*(N)e,]and{ fi, 5} CH © #&. Put p,(A) = ||E(A)e,||?. Now 
continue by induction. = 


Now for part (b) of Theorem 10.1. If [u,,] = [v,] (the notation is that of 
Theorem 10.1) for every n, then N, = N, by Theorem 3.6. Therefore 
N = M. Thus it is the converse that causes difficulties. So assume that 
N=M.IfM & @(%#), U: #- & is an isomorphism such that UNU~! = 
M, and e, is a separating vector for W*(N), then Ue, = f, 1s easily seen to 
be a separating vector for W*(M). Since p, and v, are scalar-valued 
spectral measures for N and M, respectively, it follows that [u,] = [v,]; thus 
N,, = N,, by Theorem 3.6. However, here is the difficulty—the isomor- 
phism that shows that N,, =N,, may not be oS i U; that is, if 
H= 8 H,, H= OFX, where NH, = N, and M4, , then N|#, 
= M Lt, 1 but we do not know that U#, = x, Thus we want ic argue that 
because N = M and N|#, = M|%, then N|7,* = M|#~ . In this way we 


can prove (10.1b) by induction. This step is justified by the following. 


10.6. Proposition. Jf N, A, and B are normal operators, N is *-cyclic, and 
N@A=N OB, then A = B. 


Proor. Let N € &(%#), A € B(H#,), BE B(H#H,), and let U: #8 H, > 
HX ® XH, be an isomorphism such that U(N @ A)U~' = N ® B. Now Ucan 
be written as a 2 X 2 matrix, 


where U,,;: # > #, Ui): 3, > XH, Uy: H> Hy, Uy: Hy > Hy. Ex- 
pressing N @ A and N @ B as 


N 0 N 0O 
| Alo | 0 4 
respectively, the equation U(N ® A) = (N ® B)U becomes 
; U,,N U,,A BU,, BU, | 
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Similarly, U(N @ A)* =(N @ B)*U becomes 
U,,N* U,,A* 
U,,N* U,,A* 


N*U,, N*U,> 
B*U,, B*Uy 


10.7* = 


Parts of the preceding equations will be referred to as (10.7);, and (10.7)*, 
i,j = 1,2. 

An examination of the equations U*U = 1 and UU* = 1, written in 
matrix form, yields the equations 


(a) UU ,, + UZU,, = 0 (on KH) 
10.8 (b) U*U* + U,USZ=1 (on X#) 
(c) U,,U;5 + U5,U;5 = 0 (on KH). 


Now equation (10.7), and Proposition 6.10 imply that (ker U,,) ~ reduces 
A, cl(ranU,,) = (kerU,4)~ reduces B, and 


10.9 A|(ker U,,)~ = B\(ker Us): 


What about A|kerU,, and B\ker U,4? If they are unitarily equivalent, then 
A = B and we are done. If h € kerU,, C &,, then 


Uy, U2 }} 0 z= Ui,h 
Since U is an isometry, it follows that U,;, maps ker U,, isometrically onto a 
closed subspace of 4%. Put .4, = U,,(kerU,,). Equations (10.7),, and 
(10.7), and the fact that kerU,, reduces A imply that 4, reduces N. 


Thus, the restriction of U,, to ker A,, is the required isomorphism to show 
that 


10.10 Alker U,, = NM. 


Similarly, U;* maps kerU,4 = (ranU,,)~ isometrically onto “4, = 
U,i (ker U3), @, reduces N, and 


10.11 B\ker U4 = NL). 


Note that if @%, =.@,, then (10.9), (10.10), and (10.11) show that A = B. 
Could it be that “4, and 4, are equal? 

If h € ker U,,, then (10.8a) implies that U;4U,,h = — U,*U,,h = 0. Hence 
M, = U,»,(ker U,,) € ker U,*. On the other hand, if f © ker U,*, then (10.8b) 
implies f= (UU, + U,U3)f= U,UZf. But by (10.8c), UU 3 f= 
— U,,U\4 f = 0, so U;3 f © ker U,,. Hence f © U,,(ker U,,). Thus, 


M, = ker U*. 
Similarly, 
M, = ker U,,. 
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Until this point we have not used the fact that N is *-cyclic. Equation 
(10.7),; implies that U,, © {N }”. By Corollary 6.9 this implies that U,, is 
normal. Hence, ker U,* = ker U,,, or 4, =. . | 


If the hypothesis in the preceding proposition that N is ¥*-cyclic 1s 
deleted, the conclusion is no longer valid. For example, let N and A be the 
identities on separable infinite-dimensional spaces and let B be the identity 
on a finite-dimensional space. Then N © A = N @ B, but 4 and B are not 
equivalent. However, the requirement that N be *-cyclic can be replaced by 
another, even when N, A, and B are not assumed to be normal. For the 
details see Kadison and Singer [1957]. 

The proof of Theorem 10.1(b) is now a straightforward argument as 
outlined before the statement of Proposition 10.6. The details are left to the 
reader. 

If w and pv are measures and v < p, then there is a Borel set A such that 
[v] = [p|A]. Using this fact, Theorem 10.1 can be restated as follows. 


10.12. Corollary. (a) If N is a normal operator with scalar spectral measure 
u, then there is a decreasing sequence {A,,} of Borel subsets of o(N) such 
that A, = o(N) and 


N=N,ON, 


wa, © Nya, ® -°° 


(b) If M is another normal operator with scalar spectral measure v and if 
{x ,} is a decreasing sequence of Borel subsets of o(M) such that M = N, ® 
Nis, ® Nis, --, then N=M if and only if () [pb] =[»] and (un) 


w(A,\2,,) = 0 = w(2,\4,) for all n. 


10.13. Example. Let »» be Lebesgue measure on [0,1] and let p, be 
Lebesgue measure on oe + 1,1/n] for n > 1. (Sop = Ly,.) Let N= N, 


® N® @ N© @ --- . The direct sum decomposition of N that appears a 
Corollary 10. 12 is obtained by letting 4, = [0,1/n], n 21. Then N=N, 
® Nia, @ Nia; @--: 


What does Theorem 10.1 say for normal operators on a finite-dimensional 
space? If dim #< oo, there is an orthonormal basis {e,,} for # consisting 
of eigenvectors for N. Observe that N is cyclic if and only if each eigenvalue 
has multiplicity 1. So each summand that appears in (10.2) must operate on 
a subspace of # that contains only one basis element e,, per eigenvalue. 
Moreover, since 4, is a scalar spectral measure for N, it must be that the 
first summand in (10.2) contains one basis element for each eigenvalue for 
N. Thus, if o(N) = {A,,A5,...,A,}, where A, # A, for i #/, then (10.2) 
becomes 


10.14 N=D,®D,®--- @D,, 


m 


where D, = diag(A,,A,,...,A,) and, for k > 2, D, is a diagonalizable 
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operator whose diagonal consists of one, and only one, of each of the 
eigenvalues of N having multiplicity at least k. 

There is another decomposition for normal operators that furnishes a 
complete set of unitary invariants and has a connection with the concept of 
multiplicity. For normal operators on a finite-dimensional space, this de- 
composition takes on the following form. 

Let A, = the eigenvalues of N having multiplicity k. So for A in A,, 
dimker(N — A) =k. If A, = (A: 1 <j < m,}, let N, be the diagonaliz- 
able operator on a km, dimensional space whose diagonal contains each 
\) repeated A times. So N=N,®N,@®--- ON, if o(N)=A, 
U-+- UA,. Now o(N,) = A, and each eigenvalue of N, has multiplicity 
k. Thus N, = At), where A, is a diagonalizable operator on an m, 
dimensional space with 0(A,) = A,. Thus 


10.15 N=A,@AP@--- @ A), 


and 0(A;) 1 o(A,) =O for i #7. 
Now the big advantage of the decomposition (10.15) is that it permits a 
discussion of { N }’. Because the spectra of the operators A, are disjoint, 


{N}"= {NM} @{N,}'@ --- OLN }". 


(Why?) If 36 = ker(N — AW), then dim #4“ = k and ©", = the 
domain of N,. Since AY?) # A for i # j, 


Ney = BH) a eee 0 B( 0"), 


and each #(#,‘")) is isomorphic to the k X k matrices. 

The decomposition of an arbitrary normal operator that is analogous to 
decomposition (10.15) for finite-dimensional normal operators is contained 
in the next result. The corresponding discussion of the commutant will 
follow this theorem. 


10.16. Theorem. Jf N is a normal operator, then there are mutually singular 
MEASUFES [LL ,,5 [15 bho, -.. (some of which may be zero) such that 


= N(%) aq... 
N=N° GN, ON ® , 


If M is another normal operator with corresponding measures v,,,V1,V>,.-., 
then N = M if and only if [w,] = [»,] for 1 <n < oo. 


PROOF. Let p be a scalar spectral measure for N and let {4,} be the 
sequence of Borel subsets of o( NV) obtained in Corollary 10.12. Put 2, = 
M_,4, and 2, =A,\A,., for 1 <n < oo; let wp, = pld,, 1 <n < 00. 
Put », = wlA,, 1 <n < 0. NowA, = 3, U (A,\ Anaa) U (Ane \Ana2) 
Ure = 2 UL, UZ,,.,U-:: . Hence » =p tue, t+pap toc: 
and the measures L,,H,,,-¢y,+ p> --- are pairwise singular. Hence IN», = Ni. 
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®@®N, ®N,  --+ . Combining this with Corollary 10.12 gives 
N=N,@N, @N, @--- 


=(N, @N, ON, ©---)O(N, ON ON, @--:) 
O(N, ON, ON, @---)@-- 


~ N(~H) (2) (3) ee 
_ Nu. © Nu ® ahs © Nu © 
The proof of the uniqueness part of the theorem is left to the reader. & 


Note that the form of the normal operator presented in Example 10.13 is 
the form of the operator given in the conclusion of the preceding theorem. 

Now to discuss { N }’. Fix a compactly supported measure p on C and let 
H# be an n-dimensional Hilbert space, 1 < n < oo. Define a function f: 
C — # to be a Borel function if z > (f(z), g) is a Borel function for 
each g in #,. If f: C > #%, is a Borel function and {e,} is an orthonormal 
basis for H, then WADI = LA f(z),e), so z > |If(z)|I* is a Borel 
function. Ler L*(; %,) be the space of all Borel functions f: C >- #, 
such that ||/|* = fif(2)I2 du(z) < oo, where two functions agreeing a.e. [p] 
are identified. If f and g © L7(u; %), (f,g) = (f(z), g(z)) du(z) de- 
fines an inner product on L*(u; %). It is not difficult to show that 
L*(p; 3%.) is a Hilbert space. 


10.17. Proposition. If N is multiplication by z on L?(p; %,), then N = Ne 


ProoF. Let {e,;: 1 <j < 1} be an orthonormal basis for #, and define U: 


L>(u; #2) > Lp) by Uf = (Cf()se1)s(f().e2)s «--). Then U is an 
isomorphism and UNU~' = N\””. The details are left to the reader. 


Combining the preceding proposition with Proposition 6.1(b), we can find 
PE namely, {N‘”}’ = all matrices (7,,) on B(L*(u)‘”) such that 
, © {N,} for all i, 7. By Corollary 6.9, (N; rm) yr = = all matrices (M,, ) that 
see to BL), such that $,, <= L*(u). Now the idea is to use 
Proposition 10.17 to bring this back to , BL p.; %)) and describe { N }’. 

A function ¢: C > &(#,) is defined to be a Borel function if for each f 
and g in #,, z > ($(z)f,g) is a Borel function. If { f;} is a countable 
dense subset of the unit ball of 7%, ||@(z)|| = sup{|(o(2) fi, fri ls ia < 
00}, SO z + ||(z)|| is a Borel function. Let L*(u; B(%,)) be the equivalent 
classes of bounded Borel functions from C into #(#%,) furnished with the 
u-essential supremum norm. 

If p= L(y; B(#H)) and fe L*(p; #), let f(z) = Uf(z)e;, where 
{e,} is an orthonormal basis for #, and f(z) = (f(z), e;), so Lif (2)? = 
f(z U2. Thus $(z)f(z) = Ef (z)6(ze,. So for any e in #, 
($(Z)f(Z), e) = LF (Zz) o(zZ)e;,e) is a Borel function. It is easy to check 
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that of © L*(p; #,) and |lofll < Ildllolifl. Let My: L*(m; %) > 
L*(; 36.) be defined by M,f = ¢f. Combined with the preceding remarks, 
the following result can be shown to hold. (The proof is left to the reader.) 


10.18. Proposition. If N is multiplication by z on L*(p; %,), then 
{N}'= {M,: @ © L*(u; B(H,))}. 
Also, ||Mg\| = ||¢||... for every > in L®(y; B( #,)). 


The next lemma is a consequence of Proposition 6.10 and the fact that 
unitarily equivalent normal operators have mutually absolutely continuous 
scalar-valued spectral measures. 


10.19. Lemma. Jf N, and N, are normal operators with mutually singular 
scalar spectral measures and XN, = N,X, then X = 0. 


Using the observation made prior to Corollary 6.8, the preceding lemma 
implies that {N, ® N,}’ = {N,}’ ® {N,}’ whenever N, and N, are as in 
the lemma. 

The next theorem of this section can be proved by piecing together 
Theorem 10.16 and the remaining results of this section. The details are left 
to the reader. 


10.20. Theorem. Jf N is a normal operator on #, there are mutually 
singular measures | ,,, 1, ky, ... and an isomorphism 


U: H- Bg ee H,,) ® L*( ty) ® L(y; Hy) ® --- 
such that 
UNU''=N,@®N,@N,0--- 
where N,, = multiplication by z on L*(p,; %,). Also, 
{N. © N, @ N,® ++ }'= L°(p,,; B(H,,)) © L?(m,) 
®L* (yu; B(H4)) @---. 


Using the notation of the preceding theorem, if is a scalar-valued 
spectral measure for N, then there are pairwise disjoint Borel sets A,,, A,, ... 
such that [u,,] = [u|4,,]. Define a function m,: C — {0,1, ... 00} by letting 
My = ©Xa + Xa, + 2Xa, + °°: . AS it stands the definition of my de- 
pends on the choice of the sets {A,,} as well as N. However, any two choices 
of the sets {4,} differ from one another by sets of u-measure zero. The 
function m,, 1s called the multiplicity function for N. Note that my, is a 
Borel function. 

If m: C — {0,0,1,2,...} is a Borel function and p is a compactly 
supported measure such that p({z: m(z) = 0}) = 0, let A, = {z: m(z) = 
n},n=00,1,2,....1fN,=M,, then N= NOON, ONO --- isa 
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normal operator whose spectral measure is p and whose multiplicity func- 
tion agrees with m a.e. [p]. 


10.21. Theorem. Two normal operators are unitarily equivalent if and only if 
they have the same scalar-valued spectral measure j and their multiplicity 
functions are equal a.e. [p]. 


There is some notation that is used by many and we should mention its 
connection with what we havejust finished. Suppose m: C > {00,0,1,2,...} 
is a Borel function and yp is a compactly supported measure on C such that 
u({z: m(z) = 0}) = 0. If z © C let #(z) be a Hilbert space of dimension 
m(z). The direct integral of the spaces #(z), denoted by [#(z) du(z), is 
precisely the space 


L?( pA; %,) ® L2(wld,) ® L?(uld,; 3B) @ °°, 


where A, = {z: m(z) =n} and dim 4% =n. lf ¢: C > (4H) U BC) VU 
B(H#H,)U +--+ such that o(z) € @(4,) when z E A,, : A, > B(H,) isa 
Borel function, and there is a constant M such that ||¢(z)|| < M ae. [p], 
then {$(z) du(z) denotes the operator My, © My4, ® --- as in (10.20). 
Although the direct integral notation is quite suggestive, one must revert to 
the notation of (10.20) to produce proofs. 


REMARKS. There are several sources for multiplicity theory. Most begin by 
proving Theorem 10.16. This is done for nonseparable spaces in Halmos 
[1951] and Brown [1974]. Another source is Arveson [1976], where the 
theory is set in the context of C*-algebras which is its proper milieu. Also, 
Arveson shows that the theory can be applied to some non-normal oper- 
ators. The details of this more general multiplicity theory are carried out in 
Ernest [1976] as part of a more general classification scheme. Another 
source for multiplicity theory is Dunford and Schwartz [1963]. 

By Theorem 4.6, every normal operator is unitarily equivalent to a 
multiplication operator M, on L?( X, Q, ) for some measure space (X, Q, 1). 
The scalar-valued spectral measure for M, is wo ~ ', What is the multiplic- 
ity function for M,? One is tempted to say that m uz) = the number of 
points in @ ‘'(z). This is not quite correct. The answer can be found in 
Abrahamse and Kriete [1973]. Also, Abrahamse [1978] contains a survey of 
spectral multiplicity for normal operators treated from this point of view. 


EXERCISES 


1. Let A and B be operators on # and %, respectively. Let #% and %) be 
reducing subspaces for A and B and suppose that A = B|% and B= A|X. 
Show that A = B. 


2. Let ,,M>5,... be compactly supported measures on C such that p,,,; « p,, for 
all n. Show that if M is any normal operator whose spectral measure is 
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13. 


14. 


In 
= 
@ 


absolutely continuous with respect to each p,, then N, ON, @--- = 
N,,®-:-)@M. 


. If » = Lebesgue measure on [0,1], show that N, = N? for 0 < p < oo. 


. Let » = Lebesgue measure on [0,1] and characterize the functions @ in L”(p) 


such that N, = $(N,). 


. Let » = area measure on D and show that N, and N? are not unitarily 


equivalent. 


. Let » = Lebesgue measure on [0,1] and let » = Lebesgue measure on [—1, 1]. 


Show that N, = N, @ N,. How about N,”? 


. Let » be Lebesgue measure on R and N = multiplication by sinx on L7(,). 


Find the decompositions of N obtained in Theorems 10.1 and 10.16. 


. If » is Lebesgue measure on R and N = multiplication by e’* on L*(p), show 


that N = N‘*) where m = arc length measure on dD. 


. Define U: L?7(R) > L*(R) by (Uf)(t) = f(t — 1). Show that U is unitary and 


find its scalar-valued spectral measure and multiplicity function. 


. Represent N as in Theorem 10.1 and find the corresponding representation for 


N ®N=N”™: for N®), for N‘~). (Are you surprised by the result for N‘%??) 


. Prove the results and solve the exercises from § II.8. 


. Let N be a normal operator and show that N = N® if and only if there is a 


*-cyclic normal operator M such that N = M‘*), What does this say about the 
multiplicity function for N? 


Let (X, 2,1) be a measure space such that L?(p) is separable, let ¢ © L©(,), 
and let N = M, on L’(). Find the decompositions of N obtained in Theorems 
10.1 and 10.16. 


Let » be a compactly supported measure on C, ¢ a bounded Borel function on 
C, and suppose {A,,} are pairwise disjoint Borel sets such that $ is one-to-one 
on each A, and p(C \U%_,4,)=0. Let ¢,= x, and pw, =ped, | for 


n> 1. Prove that M, on L*(,) is unitarily equivalent to ®™ ,N, . 


CHAPTER X 


Unbounded Operators 


It is unfortunate for the world we live in that all of the operators that arise 
naturally are not bounded. But that is indeed the case. Thus it is important 
to study such operators. 

The idea here is not to study an arbitrary linear transformation on a 
Hilbert space. In fact, such a study is the province of linear algebra rather 
than analysis. The operators that are to be studied do possess certain 
properties that connect them to the underlying Hilbert space. The properties 
that will be isolated are inspired by natural examples. 

All Hilbert spaces in this chapter are assumed separable. 


§1. Basic Properties and Examples 


The first relaxation in the concept of operator is not to assume that the 
operators are defined everywhere on the Hilbert space. 


1.1. Definition. If 37, #” are Hilbert spaces, a linear operator A: H-> KX 
is a function whose domain of definition is a linear manifold, dom 4, in # 
and such that A(af+ Bg) = aAf+ BAg for f,g in dom A and a, in C. 

A is bounded if there is a constant c > 0 such that ||A/f|| < c||/f|| for all f in 

dom A. 


Note that if A is bounded, then A can be extended to a bounded linear 
operator on cl[dom A] and then extended to # by letting A be 0 on 
(dom A)~ . So unless it is specified to the contrary, a bounded operator will 
always be assumed to be defined on all of #%. 
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If A is a linear operator from # into #, then A 1s also a linear operator 
from cl[dom A] into #. So we will often only consider those A such that 
dom A is dense in #%; such an operator A is said to be densely defined. 
B(H#) still denotes the bounded operators defined on 7%. 

If A, B are linear operators from # into #, then A + B is defined with 
dom( 4A + B)=dom AN domB. If B: #- & and A: # > £, then AB 
is a linear operator from # into Y with dom(AB) = B™'(dom A). 


1.2. Definition. [If A,B are operators from #% into #, then A is an 
extension of B if dom B C dom A and Ah = Bh whenever h € dom B. In 
symbols this is denoted by B C A. 


Note that if A © @(#), then the only extension of A is itself. So this 
concept is only of value for unbounded operators. 
If A: #— X, the graph of A is the set 


graA={h@®@AhE HOH: hE dom}. 
It is easy to see that B C A if and only if gra B C gra A. 


1.3. Definition. An operator A: #—> & is closed if its graph is closed in 
H® XH. An operator is closable if it has a closed extension. Let @(#, # ) 
= the collection of all closed densely defined operators from # into %. 
Let @(4) = €©(#, #). (It should be emphasized that the operators in 
6 (4, # ) are densely defined.) 

When is a subset of #® & a graph of an operator from # into #2? If 
G = gra A for some A: #—> #, then Y is a submanifold of #® XH such 
thatifk @€ 4 andO@keE FY, then k = 0. The converse is also true. That 
is, Suppose that Y is a submanifold of #@® # such that if k © # and 
O@®keEY, then k=0. Let D={heE#: there exists a k in # with 
h@®k in Y¥}. If hE @ and k,,k,&=H such that h@k,, hok,€F, 
then 0 @(k, —k,)=h@®k,-h@®k,€ Y. Hence k, =k,. That is, for 
every A in & there is a unique k in & such that hh @k © Y; denote k by 
k = Ah. It 1s easy to check that A is a linear map and ¥ = gra A. This gives 
an internal characterization of graphs that will be useful in the next 
proposition. 


1.4. Proposition. An operator A: #°— X is closable if and only if cl{gra A] 
is a graph. 


PROOF. Let cl[gra A] be a graph. That is, there is an operator B: #-> 
such that gra B = cl[gra A]. Clearly gra A C gra B, so A is closable. 

Now assume that A is closable; that is, there is a closed operator B: 
KH > HK with AC B.If0 6k € cl{[gra A], 0 ® k € gra B and hence k = 0. 
By the remarks preceding this proposition, cl[gra A] is a graph. = 


If A is closable, call the operator whose graph is cl[gra A] the closure 
of A. 
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1.5. Definition. If 4: #’— & 1s densely defined, let 
dom A* = {k € #: h > (Ah,k) is a bounded linear 
functional on dom 4}. 


Because dom A is dense in #, if k € dom A*, then there is a unique vector 
f in # such that (Ah, k) = ¢h, f) for all h in dom A. Denote this unique 
vector f by f = A*k. Thus 


(Ah, k) = (h, A*k) 


for h in dom A and k in dom A*. 


1.6. Proposition. [f A: #— & is a densely defined operator, then: 


(a) A* is a closed operator; 
(b) A* is densely defined if and only if A is closable; 
(c) if A is closable, then its closure is (A*)* = A**. 


Before proving this, a lemma is needed which will also be useful later. 
1.7. Lemma. Jf A: #-> X is densely defined and J: H® H > H® H is 
defined by J(h ® k) = (—k) @A, then J is an isomorphism and 

gra A* = [Jgra A] 


PROOF. It is clear that J is an isomorphism. To prove the formula for 
gra A*, note that gra A* = {k ® A*kK EC HOEH: k © domA*}. Soifk € 
dom A* and h € dom 4A, 


(k ® A*k, J(h ® Ah)) 


(k ® A*k, —Ah ® h) 
—(k, Ah) + (A*k, h) = 0. 


Thus gra A* C[Jgra A]*. Conversely, if k @f €[JgraA]~, then for 
every A in domdA, 0=¢k ®f,—-Ah ®h) = —¢k, Ah) + Cf, h), so 
(Ah, k) = ch, f). By definition k € dom A* and A*k = f. a 


PROOF OF PROPOSITION 1.6. The proof of (a) is clear from Lemma 1.7. For 
the remainder of the proof notice that because the map J in (1.7) is an 
isomorphism, J* = J~' and so J/*(k ®@h) =h @ (—k). 

(b) Assume A is closable and let k, € (dom A*)~. We want to show that 
k, = 0. Thus k, © 0 € [gra A*]* = [J gra A] ** = cl[J gra A] = 
J[cl(gra A)]. So 0 @ —ky =J*(k, ® 0) © J*J[cl(gra A)] = cl(gra A). But 
because A is closable, cl(gra A) is a graph; hence k, = 0. For the converse, 
assume dom A* is dense in #. Thus A** = (A*)* is defined. By (a), A** 
is a closed operator. It 1s easy to see that A C A**, so A has a closed 
extension. 

(c) Note that by Lemma 1.7 gra A** = [J *gra A*]*+ = [J*[J gra A]*]~. 
But for any linear manifold # and any isomorphism J, (JW@)* = J(.@~*). 
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Hence J*[(.4@)*]=.@~ and, thus, [J*[.@]*]*>=.@**=cl. Put- 
ting 4 = graA gives that gra A** = clegra A. a 


1.8. Corollary. Jf A © @(4#, &), then A* is densely defined and A** = A. 


1.9. Example. Let e,,e,,... be an orthonormal basis for # and let 
My, a, ... be complex numbers. Define 9 = {h € #: LF la,(h,e,)|° < 00} 
and let Ah = Lifa,ch,e, ye, forhin 9. Then A € (#) withdom A = J. 
Also, dom A* = 9 and A*h = Loa,ch,e,,e, for all h in J. 


1.10. Example. Let (X, 02, 1) be a o-finite measure space and let ¢: X — C 
be an (2-measurable function. Let D={f€ L?(p): of © L*(w)} and 
define Af = of for all f in 9. Then A € G(L7(n)), dom A* = J, and 
A*f = of for f in Z. 


1.11. Example. Let 2 = all functions f: [0,1] C that are absolutely 
continuous with f’ € L*(0,1) and such that f(0) = f(1) = 0. @ includes all 
polynomials p with p(0) = p(1)=0. So the uniform closure of JF is 
{ f © C[0,1]: f(0) = f(1) = 0}. Thus @ is dense in L7(0,1). Define A: 
L?(0,1) > L?(0,1) by Af = if’ for f in Z. To see that A is closed, suppose 
{f,} GQ and f, © if/ >f @g in L? ® L’. Let h(x) = —iffg(t) dt; so h 
is absolutely continuous. Now using the Cauchy-Schwarz inequality we get 
that [f,(x) — h(x)] = elf) + ig(t)) dt] < If! + iglls = Wit’ — alle. 
Thus f,(x) — h(x) uniformly on [0,1]. Since f, > f in L7(0,1), f(x) = 
h(x) a.e. So we may assume that f(x) = —ifo‘g(t) dt for all x. Therefore f 
is absolutely continuous and f(x) > f(x) uniformly on [0,1]; thus f(0) = 
f(Q) =0 and f’ = —ig € L*(0,1). So fE€ D and f@ g=f 6 if’ € gra A; 
that is, A € @(L7(0,1)). 
Note that {f’: f€ 9} = {hE L’(0,1): foh(x) dx = 0} = [I] +. 


Claim. dom A* = {g: g is absolutely continuous on [0,1], g’ € L7(0,1)} 
and for g in dom A*, A*g = ig’. 


In fact, suppose g € dom A* and let h = A*g. Put A(x) = {Sh(t) dt. 
Using integration by parts, for every f in J, ify f’g = (Af, g) = (f,h) = 
Io fh = fo f(x) dH(x) = —fo f(x)H(x) dx; that is, (f’, -ig) = 
(f’, -H) for all f in 9. Thus H-ige {(f’: fe D}+=[1]* 7; hence 
H — ig = c, a constant function. Thus g = ic — iH so that g is absolutely 
continuous and g’ = —ih © L’. Also note that A*g = h = ig’. The other 
inclusion 1s left to the reader. 


1.12. Example. Let & = { f € L7(0,1): f is absolutely continuous, f’ € L?, 
and f(0) = f(1)}. Define Bf = if’ for f in &. As in (1.11), B € (L7(0,1)) 
and ran B = [1]-+. 


Claim. dom B* = & and B*g = ig’ for g in @. 
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Let g@ dom B*. Put h= B*g and A(x) = [ch(t) dt. As in (1.11), 
H(0) = H(1) = 0 and for every f in @, ifjf’g = — fj fH. Hence 0 = 
(dCif’'z + f/H) = feif’(g + iH). Thus g + iH 1 ran B and so g + iH =, 
a constant function. Thus g = c — iH is absolutely continuous, g’ = —ih 
EL’, and g(0) = g9(1)=c. Thus g€& and B*g =h = ig’. The other 
inclusion is left to the reader. 

The preceding two examples illustrate the fact that the calculation of the 
adjoint depends on the domain of the operator, not just the formal defini- 
tion of the operator. Note the fact that the next result generalizes (11.2.19). 


1.13. Proposition. If A: #7 & is densely defined, then 


(ran A)” = ker A*. 
If A is also closed, then 

(ran A*)~ = ker A. 
Proor. If h 1 ran A, then for every f in dom A, 0 = (Af,h). Hence 
h € dom A* and A*h = 0. The other inclusion is clear. By Corollary 1.8, if 


A © ©(#, X), A** =A. So the second equality follows from the first. 
a 


1.14. Definition. If A: #—- & is a linear operator, A is boundedly 
invertible if there is a bounded linear operator B: #— # such that 
AB =1 and BA Cl. 


Note that if BA C1, then BA is bounded on its domain. Call B the 
(bounded ) inverse of A. 


1.15. Proposition. Let A: #’— & be a linear operator. 


(a) A is boundedly invertible if and only if ker A = (0), ran A = X#, and the 
graph of A is closed. 
(b) If A is boundedly invertible, its inverse is unique. 


PROOF. (a) Let B be the bounded inverse of A. So dom B = &. Since 
BA C1, kerA = (0); since AB = 1, ran A = &. Also, gra A = {h @ Ah: 
h <= dom A} = {Bk Ok: k © XH}. Since B is bounded, gra A is closed. 
Conversely, if A has the stated properties, Bk = A-'k for k in # is a 
well-defined operator on %. Because gra A 1s closed, gra B is closed. By the 
Closed Graph Theorem, B € &(X#, #). 

(b) This is an exercise. | 


1.16. Definition. If 4: #0—- # 1s a linear operator, p( A), the resolvent set 
for A, is defined by p(A) = {A € C: A — A is boundedly invertible}. The 
spectrum of A is the set o(A) = C \ p(A): 


It is easy to see that if A: #— # is a linear operator and A € C, gra A 
is closed if and only if gra(A — A) is closed. So if A does not have closed 
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graph, o(A) =O. Even if A has closed graph, it is possible that o( A) is 
empty (see Exercise 10). However, some of the other properties of the 
spectrum hold. The proof of the next result is left to the reader. 


1.17. Proposition. Jf A: #- # is a linear operator, then o(A) is closed 
and z > (z— A)7' is an analytic function on p(A). 


Note that if A is defined as in Example 1.9, then o( A) = cl{a,,}. Hence it 
is possible for o( A) to equal any closed subset of C. 


1.18. Proposition. Let A € ¢(#”). 


(a) A € p(A) if and only if ker(A — A) = (0) and ran(A — A) = #. 
(b) o(A*) = {X: XE o(A)} and for d in p(A), (A —-A)* 1 =[(A - 
mela: 


PROOF. Exercise. 


EXERCISES 
. If A, B, and AB are densely defined linear operators, show that (AB)* D B*A*. 
. Verify the statements in Example 1.9. 
. Verify the statements in Example 1.10. 


1 
ps 
3 
4. Define an unbounded weighted shift and determine its adjoint. 
5. Verify the statements in Example 1.11. 

6 


. If # is infinite dimensional, show that there is a linear operator A: #—- # 
such that gra A is dense in #@ #. What does this say about dom A*? 


7. Let @ be the set of absolutely continuous functions f such that f’ € L?(0,1). 
Let Df =f’ for f in 9 and let (Af)(x) = xf(x) for f in L7(0,1). Show that 
DA — AD G1. 


8. If x is a Banach algebra with identity, show that there are no elements a, b in 
S& such that ab — ba = 1. (Hint: compute a”b — ba”.) 


9. Prove Proposition 1.18. 


10. Define A: L*(IR) > L?(R) by (Af)(x) = exp(—x?)f(x — 1) for all f in L7(R). 
(a) Show that A € @(L?(R)). (b) Find ||A”|| and show that r(A) = 0 so that 
o(A) = {0}. (c) Show that A is injective. (d) Find A* and show that ran A is 
dense. (e) Define B = A4~! with dom B = ran A and show that B € 4(L7(R)) 
with o(B) = 0. 


ll. If A € @(#), show that A*A € Y(#). Show that —1 € o(A*A) and that if 
B= (1 + A*A)"|, ||Bl| < 1. 


12. If B is the bounded operator obtained in Exercise 11, show that C = AB is also 
bounded and ||C|| < 1. 
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§2. Symmetric and Self-Adjoint Operators 


A correct introduction to this section consists in a careful examination of 
Examples 1.11 and 1.12 in the preceding section. In (1.11) we saw that the 
operator A seemed to be inclined to be self-adjoint, but dom A* was 
different from dom A so we could not truly say that A = A*. In (1.12), 
B = B* in any sense of the concept of equality. This points out the 
distinction between symmetric and self-adjoint operators that it is necessary 
to make in the theory of unbounded operators. 


2.1. Definition. An operator A: #— # is symmetric if A 1s densely 
defined and ¢ Af, g) = ¢f, Ag) for all f, g in dom A. 


The proof of the next proposition is left to the reader. 


2.2. Proposition. Jf A is densely defined, the following statements are 
equivalent. 


(a) A is symmetric. 
(b) (Af, f) © R for all fin dom A. 
(c) A C A®. 


If A is symmetric, then the fact that A C A* implies dom A%* 1s dense. 
Hence A is closable by Proposition 1.6. 

It is easy to check that the operators in Examples 1.11 and 1.12 are 
symmetric. 


2.3. Definition. A densely defined operator A: “> # 1s self-adjoint if 
A = A*. 


Let us emphasize that the condition that A = A* in the preceding 
definition carries with it the requirement that dom A = dom A*. Now 
clearly every self-adjoint operator is symmetric, but the operator A in 
Example 1.11 shows that there are symmetric operators that are not 
self-adjoint. If, however, an operator is bounded, then it is self-adjoint if 
and only if it is symmetric. The operator B in Example 1.12 is an 
unbounded self-adjoint operator and Examples 1.9 and 1.10 can be used to 
furnish additional examples of unbounded self-adjoint operators. 

Note that Proposition 1.6 implies that a self-adjoint operator is neces- 
sarily closed. 


2.4. Proposition. Suppose A is a symmetric operator on #. 


(a) If ran A is dense, then A is injective. 
(b) If A = A* and A is injective, then ran A is dense and A~' is self-adjoint. 
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(c) If dom A = #, then A = A®* and A is bounded. 
(d) If ran A = #, then A = A* and A ' € BH). 


PRooF. The proof of (a) is trivial and (b) is an easy consequence of (1.13) 
and some manipulation. 

(c) We have A C A*. If dom A = #, then A = A* and so A is closed. By 
the Closed Graph Theorem 4A € &(#). 

(d) If ran A = #, then A is injective by (a). Let B = A~! with dom B = 
ran A = #. If f= Ag and h= Ak, with g,k in dom A, then (Bf,h) = 
(g, Ak) = (Ag, k) = (f,k) = (f, Bh). Hence B is symmetric. By (c), 
B = B* &€ B(H#). By (b), A = B™! is self-adjoint. 


We now will turn our attention to the spectral properties of symmetric 
and self-adjoint operators. In particular, it will be seen that symmetric 
operators can have nonreal numbers in their spectrum, though the nature of 
the spectrum can be completely diagnosed (2.8). Self-adjoint operators, 
however, must have real spectra. The next result begins this spectral 
discussion. 


2.5. Proposition. Let A be a symmetric operator and let \ = a + iB, a and 
B real numbers. 


(a) For each f in dom A, ||(A — A)fll? = |\(A — a) fll? + B'S. 
(b) If B # 0, ker(A — A) = (0). 
(c) If A is closed and B # 0, ran(A — A) is closed. 


PROOF. Note that 
(A — A) AI? = ICA — a) f — BAI? 


= ||(A — a) ff + 2Rei((A — a) f, Bf) + B'S. 
But 


((A — a) f, Bf) = BAP, f) — oB\|f\I? € R, 
so (a) follows. Part (b) is immediate from (a). To prove (c), note that 
(A — A)? = B7IAI°. Let (f,} C dom A such that (A — A)f, > g. The 
preceding inequality implies that {f,} is a Cauchy sequence in #; let 
f=lmf,. But f, ® (A —A)f, © gra(A —A)and f, @(A-A)f. of @g. 
Hence f®ge&gra(A —A) and so g=(A-—A)f€ran(A —A). This 
proves (c). - 


2.6. Lemma. Jf 7%, W are closed subspaces of # and M1 N * = (0), then 
dim @ < dimy. 


PROOF. Let P be the orthogonal projection of # onto W and define T: 

M—-> N by Tf = Pf for f in 4. Since 41 VW + = (0), T is injective. If & 
is a finite-dimensional subspace of .4, dim = dimTL< dim. . Since FY 
was arbitrary, dim.4 < dim. a 
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2.7. Theorem. /f A is a closed symmetric operator, then dim ker( A* — 2d) is 
constant for ImX > 0 and constant for ImA < 0. 


PROOF. Let A = a + iB, a and B real numbers and B # 0. 
Claim. If |A — p| < |B], ker(A* — pw) M [ker(A* — A)] > = (0). 


PURPOSE this is not so. Then there is an f in ker{ A* — p) A [ker( A* — 
X)} + with |[f|| = 1. By (2.5c), ran(A — A) is closed. Hence f € [ker(A* — 
\)]+ = ran(A — A). Let g © domA such that f=(A —AX)g. Since fe 
ker(A* — p), 


O0= ((A*—p)f,g) =(f,(4 - Bg) 
=(f,(A-A+A—jt)g) 
= [fll +(A — wf, 8). 


Hence 1 = |IfJ° = [A — eI 1(f,8)1 < IA — el lIgil. But (2.5a) implies that 
1 = |Ifll = (A —A)all = IAI Iigils so Ilgil < |B] 1. Hence 1 < |A = pI IIgl 
< |A — pl |B|~' <1 if JA — pl < |B. This sonteadicuion establishes. the 
claim. 

Combining the claim with Lemma 2.6 gives that dimker( A* — p) < 
dim ker(A* — A) if |A — p| < |B] = |ImA]. Note that if |A — p| < 35/8], 
then |A — p| < [Img], so that the other inequality also holds. This shows 
that the function A — dimker(A* — A) is locally constant on C\R. A 
simple topological argument demonstrates the theorem. ia 


2.8. Theorem. Jf A is a closed symmetric operator, then one and only one of 
the following possibilities occurs: 


(a) o(A)=C 
(b) o0(A) = {A EC: ImA = 0}; 
(c) o0(A) = {A EC: ImA < 0}; 
(d) o(A) CR. 


Proor. Let H,= {A © C: +ImA > 0}. By (2.5) for A in H,, A —-A is 
injective and ee closed range. So if A — A is surjective, A < o( A) (2.4d). 
But [ran(A — X)]+ = ker(A* — A). So the preceding theorem implies that 
either H ,C o0(A) or HA o(A) =. Since o( A) 1s closed, if H,€ o( A), 
then eile: 0(A)=C€ or o(A)= clH,. If HM 0(A)=0, 3(A) CR. 

| 


2.9. Corollary. [f A is a closed symmetric operator, the following statements 
are equivalent. 


(a) A is self-adjoint. 
(b) o( A) CR. 
(c) ker(A* — i) = ker(A* + 7) = 0. 
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PROOF. If A is symmetric, every eigenvalue of A is real (Exercise 1). So if 
A = A* and ImA # 0, ker(A* — A) = ker(A — A) = (0). By Theorem 2.8, 
o(A) C R, so (a) implies (b). 

If o(A) CR, ker(A* + i) = [ran(A + i)]}+ = #* = (0). Hence (b) im- 
plies (c). 

If (c) holds, then this, combined with (2.5) and (1.15a), implies A + 7 is 
boundedly invertible. By (1.18), A* +7 is boundedly invertible. Let h € 
dom A*. Then there is an f in dom A such that (A + i)f = (A* + i)h. But 
A* +iDdA+i, so (A* + i)f =(A* + i)h. But A* +i is injective, so 
h = f © dom A. Thus A = A*. | 


2.10. Corollary. If A is a closed symmetric operator and o( A) does not 
contain R, then A = A*. 


It may have occurred to the reader that a symmetric operator A fails to be 
self-adjoint because its domain is too small and that this can be rectified by 
merely increasing the size of the domain. Indeed, if A is the symmetric 
operator in Example 1.11, then the operator B of Example 1.12 is a 
self-adjoint extension of A. However, the general situation is not always so 
cooperative. 

Fix a symmetric operator A and suppose B is a symmetric extension of 
A: ACB. It is easy to verify that B* C A*. Since BC B*, we get 
A CBC B* C A®*., Thus every symmetric extension of A is a restriction of 
A*, 


2.11. Proposition. (a) A symmetric operator has a maximal symmetric 
extension. (b) Maximal symmetric extensions are closed. (c) A self-adjoint 
operator is a maximal symmetric operator. 


PROOF. Part (a) is an easy application of Zorn’s Lemma. If A is symmetric, 
A € A* and so A is closable. The closure of a symmetric operator is 
symmetric (Exercise 3), so part (b) is immediate. Part (c) is a consequence of 
the comments preceding this proposition. | 


2.12. Definition. Let A be a closed symmetric operator. The deficiency 
subspaces of A are the spaces 


L, = ker(A* — i) = [ran(A + i)]~, 
L = ker(A* +i) = [ran(A —i)]~. 
The deficiency indices of A are the numbers n ,= dim # ,. 


It is possible for any pair of deficiency indices to occur (see Exercise 6). 
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In order to study the closed symmetric extensions of a symmetric oper- 
ator we also introduce the spaces 


A= (J Oa Tee. 
H={g@ig: gEeL}. 


So 4 .< #® #. Notice that #, are contained in gra A* and are the 
portions of the graph of A* that lie above &,. The next lemma will 
indicate why the deficiency subspaces are so named. 


2.13. Lemma. If A is a closed symmetric operator, 
gra A* = grad O40 XK... 

Proor. Let fE ¥, and h € dom A. Then 

(h ® Ah, f ® if) = ¢h, f) — i¢Ah, f) 
—i((A + i)h, f) 

= 0 

since Y, = [ran(A + i)]~. The remainder of the proof that gra A, %,, and 
XH are pairwise orthogonal is left to the reader. Since it is clear that 
gra A ® XH, ® XH _C gra A*, it remains to show that this direct sum is dense 
in gra A*., 

Let h € dom A* and assume h © A*h 1 grad © #8 &.. Since h ® 
A*h 1 gra A, for every f in dom A, 0 = (h ® A*h, f ® Af) = (h, f) + 
(A*h, Af). So (A*h, Af) = —¢h, f) for every f in dom A. This implies 
that A*h © dom A* and A*Ad*h = —h. Therefore (A* — i)(A* + i)h = 
(A*A* + 1)h =0. Thus (A* +i)h © Y,. Reversing the order of these 
factors also shows that (A* — i)h EY. Butif gE 4,0 = (h © A*h, 2 ® 
ig) = (h,g) — i(A*h, g) = —i((A* + i)h, g). Since g can be taken equal 
to (A* + i)h, we get that (A* + i)h = 0, orh © #.. Similarly, h © &,. So 
hEL,ALFf_= (0). | 


2.14. Definition. If A is a closed symmetric operator and .@ is a linear 
manifold in dom A*, then Z is A-symmetric if (A*f, g) = (f, A*g) for all 
f, g in . Call such a manifold A-closed if { f ® A*f: f © M} is closed in 
HOH. 


So .@ is both A-symmetric and A-closed precisely when A*|.Z, the 
restriction of A* to.Z@, is a closed symmetric operator; if 47 D dom A, then 
A*|@ is a closed symmetric extension of A. 


2.15. Lemma. If A is a closed symmetric operator on # and B is a closed 
symmetric extension of A, then there is an A-closed, A-symmetric submanifold 
M of £.+ L_ such that 


2.16 gra B = gra A + gra( A*|.Z). 
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Conversely, if M is an A-closed, A-symmetric manifold in £,+ £_, then 
there is a closed symmetric extension B of A such that (2.16) holds. 


ProoF. If the A-closed, A-symmetric manifold 4 in #,+ £_ is given, let 
Q=domA +4. Since ZC dom A*, B = A*|@ is well defined. Let f= fp 
+ fi, = Bo + 21 fos Bo in dom A and f,, g, in 4. Then 


(A*f, £) = (A*fo + A*fi, Bo + 81) 


= (Afo, 89) + (Afo, 81) + CA", 80) + CATAL, B1)- 


Using the A-symmetry of .4, the symmetry of A, and the definition of A* 
we get 


(A*f, 8) = (fo, ABo) + (for A*B1) + (fi, AB0) + (fi, A*81) 


=KfA'e): 


So B= A*|QM is symmetric. Note that gra A 1 gra(A*|.Z) in #0 #. 
Since both of these spaces are closed, gra B, given by (2.16), is closed. 

Now let B be any closed symmetric extension of A. As discussed before, 
ACBCA*; so graA C graBC gradA* = gradAO#,6#.. Let G= 
gra BO (#8 #_) and let 4 = the set of first coordinates of elements in 
G. Clearly, @ is a manifoldin #,+Y and &C dom B. Hence for f, g 
in 4, (A*f, 2) = (Bf, g) = ¢f, Bg) = (f, A*g). So @ is A-symmetric. 
Clearly, gra( A*|.ZH) = GY, so @ is A-closed. If h ® Bh € gra B, let h © Bh 
=(f@Af)+k when fEdomA and KE #,0”#. Since ACB, kE 
gra B; sok © Y. This shows that (2.16) holds. Cy 


2.17. Theorem. Let A be a closed symmetric operator. If W is a partial 
isometry with initial space in &, and final subspace in £_, let 


2.18 Gy={ft+et We: fe dom 4, g € initial W } 
and define Ay, on Dy, by 
2.19 Ay(f+2e+ We) = Aft ig — iWe. 


Then A,, is a closed symmetric extension of A. Conversely, if B is any closed 
symmetric extension of A, then there is a unique partial isometry W such that 
B= Ay as in (2.19). 

If W is such a partial isometry and W has finite rank, then 


n (Ay) =n,(A) — dim(ranW). 


PROOF. Let W be a partial isometry with initial space 7, in Y, and final 
space J_ in &.. Define 9, and A,, as in (2.18) and (2.19). Let Z= {g + 
We: gel,}; so # is a manifold in ¥+%. If g,hEJ,, then 
(Wg, Wh) = (g,h). Hence (A*(g + Wg), h + Wh) = (A*g, h) + 
(A*g,Wh) + (A*Wg,h) + (A*Wg, Wh). Since g € ker(A* — i) and Wg 
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© ker(A* + 1), 
(A*(g + Wg),h+ Wh) = i(g,h) + i(g,Wh) — i(Wg,h) — i(We, Wh) 
= 1(g,Wh) — i(Wg,h). 
Similarly, (g + Wg, A*(h + Wh)) = i(g,Wh) — i(Wg,h), so that 7 is 
A-symmetric. If {g,} C/, and (g, + Wg,) ® (ig, — iWg,) > f @®h in 
HO H, then 2ig, = i(g, + Weg,) + (ig, — iWg,) > if+h and 2iWg, = 
i(g, + We,) — (ig, — iWe,) > if — h. If g = (2i) if + A), then f = g + 
Weg and h = ig — iWg. Hence @ is A-closed. By Lemma 2.15, Ay, is a 
closed symmetric extension of A. 
To prove that n,(A,) =n,(A) — dimT/,, let f= dom A, g € J,. Then 
(A, +i)(f+e+ We) =(A+t+i)ft ig —iWe + ig+ iWe 
=(A+i)f+ 2ig. 
Thus ran(A, + i) =ran(A +i) @®J,, and so n,(A,,) = dim[ran(A,, + 
i)}' = dim #6 I,=n,(A)-— dimZJ,. Similarly, n_(A,y) = n_(A) - 
dim J_=n_(A)-—dimT/,,. 

Now let B be a closed symmetric extension of A. By Lemma 2.15 there is 
an A-symmetric, A-closed manifold 4 in ¥+#_ such that gra B= 
gra A + gra(A*|.@). If fe 4, let f=f'+f, where ft*eL#,; put J,= 
{ft: fem}. Since @ is A-symmetric, 0 = (A*f, f) — (f, A*f) = 
2i¢ ft, f?) — 2i¢f, f-); hence ||f* || = |[f- || for all f in 7%. Soif WfT= f- 
whenever f = f'+ f-€.@ and if J, is closed, W is a partial isometry and 
(2.18) and (2.19) are easily seen to hold. It remains to show that J, is 
closed. Suppose { f,} CW and f* > g' in &,. Since ||f7 — fail = II, — 
f,||, there is a g” in Y such that f/ > g. Clearly f, > g*+ 2 =. 
Also, A*f,* = +if,* ~ tig*. It follows that g ®@ A*g € clgra(A*|.Z) = 
gra(A*|.Z); thus g~e I”. zi 


2.20. Theorem. Let A be a closed symmetric operator with deficiency indices 

n ,. 

(a) A is self-adjoint if and only ifn,= n_= 0. 

(b) A has a self-adjoint extension if and only if n= n_. In this case the set of 
self-adjoint extensions is in natural correspondence with the set of isomor- 
phisms of £, onto £_. 

(c) A is a maximal symmetric operator that is not self-adjoint if and only if 
eithern,= 0 andn_>0Oorn,>0 andn_= 0. 


PRooF. Part (a) is a rephrasing of Corollary 2.9. For (b), n,=n_ if and 
only if #, and Y& are isomorphic. But this is equivalent to stating that 
there is a partial isometry on # with initial and final spaces #, and #., 
respectively. Part (c) follows easily from the preceding theorem. B 


2.21. Example. Let A and @ be as in Example 1.11; so A is symmetric. 
The operator B of Example 1.12 is a self-adjoint extension of A. Let us 
determine all self-adjoint extensions of A. To do this it is necessary to 
determine #,. Now fe, if and only if f © dom A* and +if = A*f = 
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if’, so 2, = {ae**: a © C}. Hence n ,= 1. Also, the isomorphisms of Y, 
onto #_ are all of the form W,e* = Ae ~*~ where |A| = 1. If |A| = 1, let 


D, = {ft+ae*+rae*:a€C€, fe}, 
A,( f+ ae* + \ae~*) = if’ + aie* — idkae*, 
ffEeDBa€eC. 


According to Theorem 2.17, {(A,, Z,): |A| = 1} are all of the self-adjoint 
extensions of A. The operator B of Example 1.12 is the extension A,. 

For more information on symmetric operators and the relation of the 
problem of finding self-adjoint extensions to physical problems, see Reed 
and Simon [1975] from which much of the present development is taken. 


EXERCISES 
1. If A is symmetric, show that all of the eigenvalues of A are real. 


2. If A is symmetric and A,p are distinct eigenvalues, show that ker(A — A) 1 
ker(A — p). 


3. Show that the closure of a symmetric operator is symmetric. 


4. Let D= { f ] L?(0,00): for every c > 0, f is absolutely continuous on [0, c], 
f(0) = 0, and f’ € L?(0,00)}. Define Af = if’ for f in 2. Show that A is a 
densely defined closed operator and find dom A*. Show that A is symmetric with 
deficiency indices n,= 0 and n_= 1. 


5. Let &= { f € L*(— 00,0): for every c < 0, f is absolutely continuous on [c, 0], 
f(0) = 0, and f’ € L*(—,0)}. Define Af = if’ for f in &. Show that A is a 
densely defined closed operator and find dom A*. Show that A is symmetric with 
deficiency indices n,= 1, n_= 0. 


6. If k,/ are any nonnegative integers or oo, show that there is a closed symmetric 
operator A with n,= k and n_=/. (Hint: Use Exercises 4 and 5.) 


7. Let C?(0,1) be all twice continuously differentiable functions on (0, 1) with 
compact support and let Af = —f” for f in C?(0,1). Show that the closure of A 
is a densely defined symmetric operator and determine all of its self-adjoint 
extensions. 


8. If A € @(#), show that A*A is self-adjoint (see Exercise 1.11). 


§3. The Cayley Transform 


Consider the Mobius transformation 

2 

zt+i- 

It is immediate that M(0) = —1, M(1) = —i, and M(coo)=1. Thus M 


M(z)= 
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maps the upper half plane onto D and M(R U oo)= dD. So if A is 
self-adjoint, M(A) should be unitary. Suppose A is symmetric; does M(A) 
make sense? What is M( A)? 

To answer these questions, we should first investigate the meaning of 
M(A) if A is symmetric. We want to define M(A) as (A — i)(A + i) 1. As 
was seen in the last section, however, ran(A + 7) is not necessarily all of #7 
if A is not self-adjoint. In fact, (ran(A + i))* =, and (ran(A — i))*= 
L_, the deficiency spaces for A. However (2.5), if A is closed and symmetric, 
ran(A + i) is closed. Also, realize that if w = M(z), then z = M ‘(w) = 
i(1 + w)/(Q1 — w). 


3.1. Theorem. (a) Jf A is a closed densely defined symmetric operator with 
deficiency subspaces & ,, and if U: #’— # is defined by letting U = 0 on 
L, and 


3.2 U=(A-i)(At+i)— 


on L&,*, then U is a partial isometry with initial space #,* , final space £~, 
and such that (1 — U)\(£,") is dense in #. 

(b) If U is a partial isometry with initial and final spaces M and WN, 
respectively, and such that (1 — U)4 is dense in #H, then 


33 A=i1+U)—-U)° 


is a densely defined closed symmetric operator with deficiency subspaces 
L.=M* and L=N~. 

(c) If A is given as in (a) and U is defined by (3.2), then A and U satisfy 
(3.3). If U is given as in (b) and A is defined by (3.3), then A and U satisfy 
(3.2). 


PROOF. (a) By (2.5c), ran(A + i) is closed and so #/-= ran(A + i). By 
(2.5b), ker(A + i) = (0), so (A + i)~! is well defined on ¥,*. Moreover, 
(A + i) '¥%4cC dom A so that U defined by (3.2) makes sense and gives a 
well-defined operator. If he Y,", then h=(A+i)f for a unique f in 
dom A. Hence ||UAl|? = ||(A — ) fll? = (2.5a) JAP? + IAI? = CA + DAP 
= ||h||*. Hence U is a partial isometry, (kerU)*+= Y,*, and ranU = £". 
Once again, if f& dom A and h=(A+i)f, then (1 — U)h=h—-(A-—- 
i)f =(A + i)f—-—(A —-i)f =2if. So (A —- U) Y= dom A and is dense 
in #. 

(b) Now assume that U is a partial isometry as in (b). It follows that 
ker(1 — U) = (0). In fact, if f € ker(1 — U), then Uf = f; so ||fI| = || U/l 
and hence f € initialU. Since U*U is the projection onto initial U, f= 
U*Uf = U*f; so f © ker(1 — U*) = ran(1 — U)* C[(1 — U)4@)~= (0) 
by hypothesis. Thus f = 0 and 1 — U is injective. 

Let D=(1—U) mM and define (1 — U)' on Q. Because 1 — U is 
bounded, gra(1 — U)7! is closed. If A is defined as in (3.3), it follows that 
A is a closed densely defined operator. If f, g © Y, let f= (1 — U)h and 
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g=(1 —- U)k, h,k © &. Hence 


(Af, g) = i(1 + U)h, (1 — U)k) 
= i[(h,k) + (Uh, k) — (h,Uk) — (Uh, Uk]. 
Since h,k © 4, (Uh, Uk) = ¢h,k); hence (Af, g) = i[(Uh,k) — 
(h, Uk)]. Similarly, (f, Ag) = —i(Q — U)h, + U)k) = —i[(h, Uk) — 
(Uh, k)] = (Af, g). Hence A is symmetric. 

Finally, if h@ 4# and f=(1 — U)hA, then (A + i) f= Af + if= itl + 
U)h + id. — U)h = 2ih. Thus ran(A + 1) =. Similarly, (A — i)f = id 
+ U)h —i(1 — U)h = 2Uh, so that ran(A — 7) = ranU = Y. 

(c) Suppose A is as in (a) and U is defined as in (3.2). If g € (1 — U)Y,", 
put g=(1 — U)h, where h € Y= ran(A + i). Hence h = (A + 1)f for 
some f in dom A. Thus g =h — Uh=(A+i)f—(A — i)f = 2if; so f= 
— tig. Also, 

i1+U)Q—-U) 


“g =i(1 + U)h 
= i[h + Uh] 


=i[(A +i f+(4-A)f] 
= 2iAf 
= Ag. 
Therefore (3.3) holds. 
The proof of the remainder of (c) is left to the reader. a 


3.4. Definition. If A is a densely defined closed symmetric operator, the 
partial isometry U defined by (3.2) is called the Cayley transform of A. 


3.5. Corollary. If A is a self-adjoint operator and U is its Cayley transform, 
then U is a unitary operator with ker(1 — U) = (0). Conversely, if U is a 
unitary with | € o,(U), then the operator A defined by (3.3) is self-adjoint. 


PROOF. If A is a densely defined symmetric operator, then A is self-adjoint 
if and only if & = (0). A partial isometry is a unitary operator if and only 
if its initial and final spaces are all of #. This corollary is now seen to 
follow from Theorem 3.1. a 


One use of the Cayley transform is to study self-adjoint operators by 
using the theory of unitary operators. Indeed, the preceding results say that 
there is a bijective correspondence between self-adjoint operators and the 
set of unitary operators without 1 as an eigenvalue. 


EXERCISES 


1. If U is a partial isometry, show that the following statements are equivalent: (a) 
ker(1 — U) = (0); (b) ker. — U*) = (0); (c) ran(1 — U) is dense; (d) ran(1 — 
U*) is dense. 
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2. Let U be a partial isometry with initial and final spaces “ and WW, respectively. 
Show that the following statements are equivalent: (a) (1 — U).@ is dense; (b) 
(1 — U*)W is dense; (c) ker(U* — U*U) = (0); (d) ker(U — UU*) = (0). 


3. Find a partial isometry U such that ker(1 — U) = (0) but (1 — U)(kerU)~* is 
not dense. 


4. If A is a densely defined closed symmetric operator and B and C are the 
operators defined in Exercises 1.11 and 1.12, then the Cayley transform of A is 
(C — iB\(C + iB)*}. 


5. Find the Cayley transform of the operator in Example 1.9 when each a,, 1s real. 
6. Find the Cayley transform of the operator in Example 1.10 when ¢ is real valued. 


7. Let S be the unilateral shift of multiplicity 1 and find the symmetric operator A 
such that S is the Cayley transform of A. 


8. Let U = S*, where S is the unilateral shift of multiplicity 1. Is U the Cayley 
transform of a symmetric operator A? If so, find it. 


§4. Unbounded Normal Operators and the 
Spectral Theorem 


If A is self-adjoint, the classical way to obtain the spectral decomposition of 
A is to let U be the Cayley transform of A, obtain the spectral decomposi- 
tion of U, and then use the inverse Cayley transform to translate this back 
to a decomposition for A. There is a spectral theorem for unbounded 
normal operators, however, and the Cayley transform is not applicable here. 

In this section the approach is to prove the spectral theorem for normal 
operators by using that theorem for the bounded case. The spectral theorem 
for self-adjoint operators is then only a special case. 


4.1. Definition. A linear operator N on # is normal if N 1s closed, 
densely defined, and N*N = NN*. 


Note that the equation N*N = NN* that appears in Definition 4.1 
implicitly carries the condition that dom N*N = dom NN *. The operators 
in Examples 1.9 and 1.10 are normal and every self-adjoint operator is 
normal. Examining Example 1.9 it is easy to see that for a normal operator 
it is not necessarily the case that dom N*N = dom N. 

Parts of the next result have appeared in various exercises in this chapter, 
but a complete proof is given here. 


4.2. Proposition. Jf A € @(#), then 


(a) 1 + A*A4 has a bounded inverse defined on all of #. 
(b) If B = (1 + A*A)~', then ||B\| < 1 and B = 0. 
(c) The operator C = A(1 + A*A)~? is a contraction. 
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(d) A*A is self-adjoint. 
(e) {h ® Ah: h © dom A*A} is dense in gra A. 


PROOF. Define J: #8 #H- H8 H by J(h ®k)=(—-k) @h. By Lemma 
1.7, gra A* = [Jgra A]*. So if h © #, there are f in domA and g in 
dom A* such thatO @h=JS(f @ Af)+g 0 A*g=(—-Af) Oft+ 2 A*g. 
Hence 0 = —Af+eg, or g=Af; also, h=f+A*g=f+ A*Af=(1 + 
A*A)f. Thus ran(] + A*A) = #. 

Also, for f in dom A*A, Af © dom A®* and ||f + A*Af||? = |I fl]? + 2\AAl? 
+ ||A*Af\|? > |[f\l?. Hence ker(1 + A*A) = (0). Thus (1 + A*A)~! exists, is 
closed, and is defined on all of #7. It must be that (1 + A*4)"! € B(#) 
(1.15). This proves (a). 

It was shown that ||(1 + A*A)f|| > ||/]| whenever f € dom A*A. If h = (1 
+ A*A)f and B = (1 + A*A)7', then this implies that ||Bh|| < ||A||. Hence 
\|B|| < 1. In addition, (Bh, h) = (f,(1 + A*A)f) = |IfI|* + JAP]? = 0, so 
(b) holds. 

Put C = A(1 + A*A)~! = AB; if f € dom A*A and (1 + A*A)f = h, then 
|Ch||? = ||Afl|* < ||. + A*A)fI|* = |JA||? by the argument used to prove 
(a). Hence ||C]| < 1, so (c) is proved. 

Now to prove (e). Since A is closed, it suffices to show that no nonzero 
vector in gra A is orthogonal to {h ® Ah: h € dom A*A}. So let g € dom A 
and suppose that for every h in dom A*A, 

0 = (g ® Ag,h @ Ah) 
= (8,h) + (Ag, Ah) 
= (8,h) + (g, A*Ah) 
= (g,(1 + A*A)A). 

So g L ran(1 + A*A) = #; hence g = 0. 

To prove (d), note that (e) implies that dom A*4 is dense. Now let 
f, g © dom A*A; so f, g © dom A and Af, Ag € dom A*. Hence ¢ A*4f, g) 
= (Af, Ag) = (f, A*Ag). Thus A*4 is symmetric. Also, 1 + A*A has a 
bounded inverse. This implies two things. First, 1 + A*A is closed, and so 
A*A is closed. Also, —1 €¢ 0(A*A) so that by Corollary 2.10, A*A is 
self-adjoint. S 


4.3. Proposition. Jf N is a normal operator, then dom N = dom N* and 
|/I| = || *f]| for every fin dom N. 


PROOF. First observe that if h @€ dom N*N = dom NN*, then Nhe 
dom N* and N*h € dom N. Hence ||Nh||? = (N*Nh, hy = (NN*h,h) = 
\|V *h||*. Now if f € dom N, (4.2e) implies that there is a sequence {h, } in 
dom N*N such that h, ® Nh, > f ® Nf; so ||Nh, — Nfl| > 0. But from 
the first part of this proof, ||N*h, — N*h, || = \|Nh, — Nh,,||. So there is a 
gin # such that N*h, — g. Thush, @ N*h, > f @ g. But N* is closed; 
thus fedomN* and g=N*f. So domNCdomN* and ||Nfl = 
lim||NA,| = lim||N *h, || = ||N*f]). 

On the other hand, N* is normal (Why?), and so dom N* C dom N ** 
= dom N. a 
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4.4. Lemma. Let #,, %,, ... be Hilbert spaces and let A, © B(#.,) for all 
n>1. If B= {(h,)€ OH: Lj ||A,h,||° < 00} and A is defined on 
H= @ KH, by A(h,) = (A,h,,) whenever (h,,) © GZ, then A © 6(#). A is 
a normal operator if and only if each A,, is normal. 


PROOF. Since #, C & for each n, F is dense in #. Clearly A is linear. If 
{h\\ Cc dom A and h“)@ Ah >h@g in #@ #, then for each n, 
h@A, AY +h, ® g,. Since A, is bounded, 4,h, = g,. Hence ¥,||Ah,]||? 
= Yllz,ll° = |lgll2 < 00; so h € dom A. Clearly Ah = g, 80 A € G(#). 

It is left to the reader to show that dom A* = {(h,,) € #: L%_,||A*A,]||? 
< oo} and A*(h,) = (A*h,,) when (h,,) € dom A*. From this the rest of 


the lemma easily follows. i) 


If (X,02) is a measurable space and # is a Hilbert space, recall the 
definition of a spectral measure F for (X, 2, 7) ([X.1.1). If h,k © #, let 
E,,, be the complex-valued measure given by E, ,(4) = (E(A)h, k) for 
each A in 2. 

Let ¢: X — C be an (2-measurable function and for each n let A, = {x 
eX: n-—1 < |o(x)| <n}. So x, ¢ is a bounded {2-measurable function. 
Put # = E(A,)#. Since Ux_,A, = X and the sets {A,} are pairwise 
disjoint, ®*~ #4, = #. If E,(4) = E(A 1 4,), £,, is a spectral measure 
for (X, 0, 4). Also, {¢dE,, is a normal operator on #%,. Define 


oe) 2 
4.5 a,=\h Ex: > [ fede, |E(4,)h < oo} 
n=1 
By Lemma 4.4, N,: #—> # given by 
4.6 Nh= > [ fede, )E(4,)h 
n=1 


for h in Y, is a normal operator. The operator N, is also denoted by 


N, = [odE. 


4.7. Theorem. If E is a spectral measure for (X,0, 4), ¢: X > C is an 
(Q-measurable function, and 9, and N, are defined as in (4.5) and (4.6), 
then: 


(a) DB, = {hEH#: f\o|? dE, , < 0}: 
(b) forhin B, andfin #,¢6¢€ L\(\E,, /\) with 


1/2 
4.8 fieidlz,. As if J ol? aE, . 


4.9 (| foae}n.s)= feats 
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Proor. Using the *-homomorphic properties associated with a spectral 
measure (IX.1.12), one obtains 


|[foat,] eon - ([feseat] (Jrxswaen 2 


=) lol dE, ,.- 


From here, (a) is immediate. 

Now let he Y,, fe #. By the Radon—Nikodym Theorem, there is an 
{2-measurable function u such that |u| = 1 and |E, | = uF, ,, where |E,, | 
is the variation for E, ,. Let $, = L4_1X 4,9; So $, is bounded as is ud,,. 
Thus 


fleldlEn | = fldludE, 5 
- ([ fieamae a.) 
< in| I aludé a) 


But 


[feat al = ([ fiewae}™.( flonieat |) 
= ({ fie.Pae a, 
= {\e,1° dE, 


< {lol dE, 
Combining this with the preceding inequality gives that /|9,|d|E, ;| < 
WIC ||? dE, ,,)'77 for all n. Letting n > 00 gives (4.8). Since , is bounded, 


(( fo, dE)h, f) = fo,dE,, If he@D, and fe#, then (4.8) and the 
Lebesgue Dominated Convergence Theorem imply that /¢, dE, +> [odE, f 


as n — oo. But 
[fo,ae|h = ( foae |e Ua,)) 
Ua, [ fae )a. 


Since E(U"_,4,) > E(X)=1 (SOT) as n> ow. ((fo,dE)h, f) > 
((fodE)h, f) as n > oo. This proves (4.9). a 


=E 
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Note that as a consequence of (4.7) dom N, and the definition of N, do 
not depend on the choice of the sets {4,} as would seem to be the case 
from (4.5) and (4.6). 


4.10. Theorem. Jf (X, {2) is a measurable space, # is a Hilbert space, and 
E is a spectral measure for (X,Q, 4), let ®(X,Q) be the algebra of all 
(2-measurable functions ¢: X—C and define p: ®(X,02) > ©(#) by 
e(>) = {odE. Then for o, in ®CX, &2): 


(a) p(>)* = p(¢); 

(b) p(y) D p(o) p(y) and dom(p(¢)p(Y)) = BN Dy 
(c) If p is bounded, p()p(y) = p(y) p(>) = p(oy); 
(d) p()*o() = p(|9]°). 


The proof of this theorem is left as an exercise. 


4.11. The Spectral Theorem. /f N is a normal operator on #, then there is a 
unique spectral measure E defined on the Borel subsets of © such that: 


(a) N = [zdE(z); 

(b) E(A)=0 ifANo(N)=0 

(c) if U is an open subset of C and UM o(N) #0, then E(U) # 0; 

(d) if A € B(#) such that AN C NA and AN* C N*A, then A( {@dE) C 
({¢dE)A for every Borel function © on C. 


Before launching into the proof, a few words motivating the proof are 
appropriate. Suppose a spectral measure F defined on the Borel subsets of 
C is given and let N = {zdE(z). It is not difficult to see that if 0 < a < b 
< oo and A is the annulus {z: a < |z| < b}, then 4% = E(A)#H= (he 
dom N: h€ dom WN” for all n and a”|h|| < ||N"A|| < b"\|Al|}. FH 1s a 
closed subspace of # that reduces N and N|#, is bounded. The idea 
behind the proof is to write C as the disjoint union of annuli {4 ;} such that 
for each A, there is a reducing subspace #%, for N with ‘N, = N Ran 
bounded, and, moreover, such that #= @, Hy, Once this is “done the 
Spectral Theorem for bounded normal operators can be applied to each N, 
and direct sums of these can be formed to obtain the spectral measure 
for N. 

So we would like to show that for the annulus {z: a < |z| < b}, {he 
dom N: h € dom N” for all n and a” ||Al|| < ||N“A|| < b"||Al|} is a reducing 
subspace for N. To facilitate this, we will use the operator B = (1 + N*N)7! 
which is a positive contraction (4.2). To understand what is done below note 
that z +> (1 + |z|*)~! maps C onto (0,1] and a < |z| < b if and only if 
J+a’*)'>(14 jz?) t > 467)! 


4.12. Lemma. If N is a normal operator, B = (1 + N*N)~', and C = N(I 
+ N*N)~', then BC = CB and (1+ N*N) IN CC. 
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PROOF. From (4.2), B and C are contractions and B > 0. It will first be 
shown that (1 + N*N)"'N C C; that is, BN C NB. If f € dom BN, then 
fedomN. Let g&@domN*N such that f= (1 + N*N)g = Bg. Then 
N*Ng € dom N; hence Ng € dom NN* = dom N*N. Thus Nf = Ng + 
NN *Ng = (1+ N*N)Ng. Therefore BNf = B(1 + N*N)Ng = Ng. But 
NBf = Ng, so BN = NB on dom N. Thus BN C NB. 

If he #, let fe domN*N such that h=(1+ N*N)f. So BCh= 
BNBh = BNf = NBf = NBBh = CBh. Hence BC= CB. & 


4.13. Lemma. With the same notation as in Lemma 4.12, if B = [{[tdP(t) is 
its spectral representation, 1 > 6 > 0, and A is a Borel subset of [6,1], then 
H,= P(A)#C dom N, #4, is invariant for both N and N*, and N\|H, is a 
bounded normal operator with ||N|H,\| < [1 — 6)/6]'”?. 


Proork. If h € %4, then because P(A) = x ,(B), ||Bh\|° = (B*P(A)h,h) = 
[,t? dP, , = 8°*||All’. So BlZ#4, is invertible and there is a g in %, such 
that h = Bg. But ran B = dom(1 + N*N) C domN. Hence h € dom N; 
that is, %, C dom N. 

Let h © #, and again let g © % such that h = Bg. Hence Nh = NBg 
= Cg. By Lemma 4.12, BC = CB; so by (IX.2.2), P(A)C = CP(A). Since 
geH,, Nh= Cg &€ #,. Note that if M=N* and B, =(1+ M*M)"', 
then B, = B. From the preceding argument N*#, = M#%, C H,. It easily 
follows that N|24, is normal. 

Finally, if h © 4%, then 


||NA||? = (N *Nh, h) 
= ({((N*N + 1)-1]A,A) 


= ee — 1) dP, ,(t) < AI? — 8)/6. 
Hence ||N|%A|| < (1 — 6)/6]'”. sm 


PROOF OF THE SPECTRAL THEOREM. Let B = (1 + N*N)-'and C = M(1 + 
N*N)~‘as in Lemma 4.12. Let B = {(tdP(t) be the spectral decomposi- 
tion of B and put P, = P(X. /(n + 1),1/n] for n = 1. Since ker B = (0) = 
P({O}), UP, = 1. Let 4 = PH. By Lemma 4.13, #4 CdomN, # 
reduces N, and N, = N|#%. is a bounded normal operator with ||N,|| < n'7°. 
Also, if h © #,, (1 + N*N,)Bh = BL + NN, )h = h; that is, 


Bit, = (1+ NAN,) 


Thus if A €o(N,), (1 + |Al?)~! © o( BI) C [1/(n + 1),1/n]. Thus 
o(N,)C{z EC: (n-1)'"% < |z| <n'7}=4,. Let N,= f[zdE,(z) be 
the spectral decomposition of N,. For any Borel subset A of C, let E(A) be 
defined by 


4.14 E(A) = y E(ANA,). 


n=1 
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Note that E,(4 1 A,) is a projection with range in %. Since #, 1 %, 
for n # m, (4.14) defines a projection in #(#). (Technically E(4) should 
be defined by E(A) = 1°_, E,(A NA,)P,. But this technicality does not 
add anything to understanding.) 

To show that E is a spectral measure, it is clear that E(C) = 1 and 
E(Q) = 0. If A, and A, are Borel subsets of C, then 


E(A, ‘ A;) a > E,(A, mM A, M” A,) 


n=1 


= 2 E,(A, -” A, JE,(A, (" A,,). 
n=l 


Again, the fact that the spaces { % } are pairwise orthogonal implies 


E(A, MO A,) = | x E,(A, M A,) 


n=1 


| E,(A,N4,) 
n=1 


= E(A,)E(A,). 


If h € #, then (E(A)h,h) = UP_ CE, (AN A, )h, h). Soif {A, }2) are 
pairwise disjoint Borel sets, 
£ h,h 
1 


(| UA,)han ae 


x x (E,(A, ™ A,,)h,h). 


n=1 j=1 
Since each term in this double summation is positive, the order of summa- 
tion can be reversed. Thus 


A; 


ae 


ak: 


NA, 


So E(US_,A,) = U2, E(A,); therefore E is a spectral measure. 

Let M = {zdE(z) be defined as in Theorem 4.7. Thus 7%, C dom M and 
by the Spectral Theorem for bounded operators, Mh = Nh = Nhifh © #,. 
If A is any vector in dom M, h = X&h,, h, € %, and L®||NA,||* < 00. 
Thus h € dom N and Nh = Mh. This proves (a). 


4.15. Claim. o(N)= af U o(,) 


n=1 


It is left to the reader to show that U%_,o(N,) € o(N). Since o(N) is 
closed, this proves half of (4.15). If A € cl[U%_,o(N,,)], then there is a 6 > 0 
such that |A — z| > 6 for all z in U%_,o(N,). Thus (N, — A)~* exists and 


n=1 


X.4. Unbound Normal Operators and the Spectral Theorem 333 


WON, — Ao" < 67! for all n. Thus A = ®°,(N, —A)~! is a bounded 
operator. It follows that A =(N —A)71, so A € o(N). 

By (4.15) if AN o(N) =O, 4N o(N,) =O for all n. Thus FE, (A) = 0 
for all n. Hence E(A) = 0 and (b) holds. 

If U is open and UN o(N) # O, then (4.15) implies UN o(N,,) # O for 
some n. Since E,(U) # 0, E(U) # 0 and (c) 1s true. 

Now let A © @(#%) such that AN C NA and AN* C N*A. Thus A(1 + 
N*N)C (1 + N*N)A. It follows that AB = BA. By the Spectral Theorem 
for bounded operators, A commutes with the spectral projections of B. In 
particular, each #, reduces A and if A, = A|#%, then A,N, = N,A,. 
Hence A,o(N,,) = $(N,,)A,, for any bounded Borel function ¢. The remain- 
ing details are left to the reader. i 


The Fuglede—Putnam Theorem holds for unbounded normal operators 
(Exercise 8), so that the hypothesis in part (d) of the Spectral Theorem can 
be weakened to AN C NA. 


4.16. Definition. If N is a normal operator on #, then a vector e, is a 
star-cyclic vector for N if for all positive integers k and /, e, € dom( N *N! ) 
and #= V{N**N’e,: k,1 > 0}. 


4.17. Example. Let p» be a finite measure on C such that every polynomial 
in z and z belongs to L*(p). Let 9, = {f © L*(p): zf © L?(w)} and define 
Nf = 2f for f in ,. Then N, is a normal operator and 1 is a star-cyclic 
vector for N,. 


Note that du(z) = e |?!d Area(z) is a measure satisfying the conditions 
of (4.17). 


4.18. Theorem. Jf N is a normal operator on # with a star-cyclic vector ey, 
then there is a finite measure p on C such that every polynomial in z and Zz 
belongs to L?(u) and there is an isomorphism W: #?— L*(w) such that 
We, = 1 and WNW ' = N.. 


The proof is similar to the proof of Theorem IX.3.4 and is left to the 
reader. 


4.19. Theorem. Jf N is a normal operator on the separable Hilbert space #, 
then there is a o-finite measure space ( X, 92, w) and an (-measurable function 
@ such that N is unitarily equivalent to M, on L7(p). 


The proof of Theorem 4.19 is only sketched. Write N as the (unbounded) 
direct sum of bounded normal operators { N,,}. By Theorem IX.4.6, there is 
a o-finite measure space (X,, 2, u,,) and a bounded (2,-measurable func- 
tion ¢, such that N, = M,. Let X = the disjoint union of { X,,} and let 
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2={AcKX: AN X, € OQ, for every n}. If A € Q, let w(A) = Lp, (An 
X,,). Let @: X —>C be defined by $(x) = $,(x) 1f x © X,. Then @ 1s 
(Q-measurable and N = M, on L*(X, 2, p). 


EXERCISES 


1. Prove Theorem 4.10. 


NO 


. Show that if A is a symmetric operator that is normal, then 4 is self-adjoint. 


3. Using the notation of Theorem 4.10, what is o( {¢ dE)? 


= 


. If A, and E, are as in the proof of the Spectral Theorem, show that £,,(4,,, ;) 
I E, (A,,-1) = 0. 


5. Use the Spectral Theorem to show thatif0 <a<b<o,A={zEC:a< |z2| 
< b}, and N = {zdE(z) is the spectral decomposition of the normal operator 
N, then E(A)#= {hE domN: a” |Al| < ||NA|| < b"||Al| for all n > 1}. 


6. State and prove a polar decomposition for operators in (#, #). 
7. If A is self-adjoint, prove that exp(iA) is unitary. 


8. (Fuglede—Putnam Theorem.) If NV, M are normal operators and A is a bounded 
operator such that AN C MA, then AN* C M*A. 


9. Prove Theorem 4.18. 


10. If ,, > are finite measures on C and N, , N,, are defined as in Example 4.17, 
show that N, = N,, iff [4] = [u2]. 


11. Fill in the details of the proof of Theorem 4.19. 


§5. Stone’s Theorem 


If A is a self-adjoint operator on #, then exp(iA) is a unitary operator 
(Exercise 4.7). Hence U(t) = exp(itA) is unitary for all ¢ in R. The purpose 
of this section is not to investigate the individual operators exp(itA), but 
rather the entire collection of operators {exp(itA): t © R}. In fact, as the 
first theorem shows, U: R — unitaries on # is a group homomorphism 
with certain properties. Stone’s Theorem provides a converse to this; every 
such homomorphism arises in this way. 


5.1. Theorem. /f A is self-adjoint and U(t) = exp(itA) for t in R, then 
(a) U(t) is unitary; 

(b) U(s + t) = U(s)U(t) for all s in R; 

(c) ifh © #, then lim, _,,U(s)h = U(t)h; 

(d) ifh € dom A, then 


5.2 lim —[U(t)A ~ A] = iAh: 
t- 


ST 


(e) ifh © # and lim, _,,t~'[U(t)h — h] exists, then h © dom A. 
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PRooF. As was mentioned, part (a) 1s an exercise. Since exp(itx )exp(isx) = 
exp(i(s + ¢)x) for all x in R, (b) is a consequence of the functional calculus 
for normal operators [(4.10) and (4.11)]. Also note that U(O)U(t) = U(t), so 
that U(O) = 1. 

(c) If h € #, then ||U(t)h — U(s)h|| = ||U(t — s + s)h — U(s)hl\| = [by 
(b)] || UCs [Ut — s)h — All| = || UG — s)h — hl] since U(s) is unitary. Thus 
(c) will be shown if it is proved that ||U(t)h —h|| ~0 as t- 0. If 
A = {*”,,xdE(x) is the spectral decomposition of A, then 


|U(A)A— Ay? = fle" — 1P dE, (2). 


Now E, , is a finite measure on R; for each x in R, |e* — 1|* > 0 as 
t—>0; and je”*—1|* <4. So the Lebesgue Dominated Convergence 
Theorem implies that U(t)h > h as t > 0. 

(d) Note that ¢~'[U(t) — 1] — iA = f,(A), where f(x) = t7 "[exp(itx) — 
1] — ix. So if h € dom A, 


|Flu( —h] — iAh | = |If,CA)All’ 


- 
As t > 0, t-"[e* — 1] — ix > 0 for all x inR. Also, |e’ — 1| < |s| for all 
real numbers s (Why?), hence |f,(x)| < |t|~*]e’* — 1] + |x| < 2|x|. But 
|x| € L*(E, ,) by Theorem 4.7(a). So again the Lebesgue Dominated 
Convergence Theorem implies that (5.2) is true. 
(ec) Let D= {hE H#: lim,_, 9t7'[U(t)h — h] exists in #)}. For h in J, 
let Bh be defined by 


eitx ae | 
t 


2 
— ix| dE, ,(x). 


Bh = —ilim UU)h=h 
t-0 t 


It is easy to see that Z is a linear manifold in # and B is linear on Q. 
Also, by (d), B 2A so that B is densely defined. Moreover, if h,g € J, 
then 


(Bh, g) = tie s). 
t- 


By (b) and the fact that each U(t) is unitary, it follows that U(t)* = U(t)~! 


= U(-t). Hence 
~itim ( Cees 2 


Bh, 
(Bh, g) lim ; 


im (a, ~ 1] Se 8/) 


= ¢h, Bg). 
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Hence B is a symmetric extension of A. Since self-adjoint operators are 
maximal symmetric operators (2.11), B = A and Y= dom A. a 


Inspired by the preceding theorem, the following definition is made. 


5.3. Definition. A strongly continuous one-parameter unitary group 1S a 
function U: R > @(#) such that for all s and ¢ in R (a) U(‘) is a unitary 
operator; (b) U(s + t) = U(s)U(t); (c) if h © #, then U(t)h > U(ty)h as 


bts, 


Note that by Theorem 5.1, if A is self-adjoint, then U(t) = exp(itA) 
defines a strongly continuous one-parameter unitary group. 

Also, U(0) = 1 and U(—t) = U(t)~', so that {U(t): t © R} is indeed a 
group. Property (c) also implies that U: R > (4(#), SOT) is continuous. 
By Exercise 1, if U is only assumed to be WOT-continuous, then U 1s 
SOT-continuous. However, this condition can be relaxed even further as the 
following result of von Neumann [1932] shows. 


5.4. Theorem. If # is separable, U: R > &(#) satisfies conditions (a) 
and (b) of Definition 5.3, and if for allh, gin # the function t > (U(t)h, g) 
is Lebesgue measurable, then U is a strongly continuous one-parameter unitary 
group. 


PRooF. If 0 < a< oo and h,g&#, then t> (U(t)h, g) 1s a bounded 
measurable function on [0, a] and hence 


J \U(t)h, g) | dt s allaliisl 
Thus 
he» [-(U(t)h, g) dt 
0 
is a bounded linear function on #. Therefore there is a g, in # such that 
5.5 (A, 84) = [ (U(t)h, 8) de 
and ||g,l| < allgll. 


Claim. {g,: g © #, a > 0} 1s total in #%. 


In fact, suppose h © # andh 1 {g,: g © #, a > 0}. Then by (5.5), for 
every a > 0 and every g in #, 


0 = [U()h, 8) a. 


Thus for every g in #, (U(t)h, g) = 0 ae. on R. Because # is separable 
there is a subset A of R having measure zero such that if ¢ € A, (U(t)h, g) 
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= Q whenever g belongs to a preselected countable dense subset of #%. 
Thus U(t)h = 0 if t € A. But |/h|| = ||]U(t)h||, so h = O and the claim is 
established. 

Now if s € R, 


(h,U(s)g,) = (U(-S)h, 8.) 


= [uC — s)h, g) dt 


={  (U(t)h,g) dt. 


Thus ¢h, U(s)g,) > (h, g,) as s > 0. By the claim and the fact that the 
group is uniformly bounded, U: R — (4(#), WOT) is continuous at 0. By 
the group property, U: R > (@(#), WOT) is continuous. Hence U is 
SOT-continuous (Exercise 1). a 


We now turn our attention to the principal result of this section, Stone’s 
Theorem, which states that the converse of Theorem 5.1 is valid. Note that 
if U(t) = exp(itA) for a self-adjoint operator A, then part (d) of Theorem 
5.1 instructs us how to recapture A. This is the route followed in the proof 
of Stone’s Theorem, proved in Stone [1932]. 


5.6. Stone’s Theorem. Jf U is a strongly continuous one-parameter unitary 
group, then there is a self-adjoint operator A such that U(t) = exp(itA). 


PROOF. Begin by defining 9 to be the set of all vectors h in # such that 
lim,_.9t ‘[U(t)h — h] exists; since 0 € J, D#U. Clearly F is a linear 
manifold in #. 


5.7. Claim. @ is dense in #. 

Let Y= all continuous functions @ on R such that @ € L'(0, 00). Hence 
for any h in #, t $(t)U(t)h is a continuous function of R into #. 
Because ||U(¢)A|| = ||A|| for all ¢, a Riemann integral, /[°@(t)U(t)h dt. can 


be defined and is a vector in #. Put 


5.8 T,h = [ou har 


It is easy to see that T,: #°—> # is linear and bounded with ||7,]| < 
{5° \o(t)| dt. Similarly, for each ¢@ in 


5.9 S,h = [9()u(-hd 


defines a bounded operator on #%. 
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For any ¢ in # and ¢ inR, 
U(t)T,h = U(t) [6(s)U(s)has 
= [9(s)U( + s)hds 
= fo ols- 1)U(s)hds. 


Similarly, 


U(t)S,h = [4s + t)U(-—s)hds. 


Now let #“ = all @ in & that are continuously differentiable with ¢’ in 
L. For ¢ in Y, 


= -[U(t) —1]T,h = - rf ls — t)U(s)hds + rf o(s)U(s)hds 
= ~if*[ PE 9) lasynas 


t 
1 ft 
+ 7 | o(s)U(s)hds. 
Now 


pes — t) — 9(s) |u(s)nas 


t 


< ||A|lsup{|o(s — t) — o(s)|:0< 5 <1} 
— 0 
as t — 0. Hence 
ae (* EAP) o(s)n a = ~ fo o(s)U(s)has 


= Tih. 


Since s > $(s)U(s)h is continuous and U(Q) = 1, the Fundamental Theo- 
rem of Calculus implies that 


lim ~ ['6(s)U(s)hds = H(O)h. 
r>0 f vo 
Hence for ¢ in Y“ and hin #, 
5.10 lim — —[U(t) —1]T,h = iTyh + i6(0)A. 
t—>0 
Similarly, for ¢ in ZY“ and h in #, 


5.11 lim — ~[U(t) —1]S,h = —iSyh — ip(0)h. 
t- 
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So (5.10) implies that 
D2{T,h: 6 EL andhe x}. 
But for every positive integer n there is a ¢, in Y such that ¢, > 0, 
¢, (t) = 0 for t= 1/n, and {§°¢,(t) dt = 1 (Exercise 2). Hence 
l/n 
T,h —h =| ¢,(t)[U(t) — 1] hat 


and so ||T,h — All < sup{||U(t)A — hij: 0 <t < 1/n}. Therefore ||7, h — 
h|| > 0asn — o since U is strongly continuous. This says that J is dense. 
For h in Q, define 


5.12 Ah = ~itim =[U(t) - 1h. 
t—0 


5.13. Claim. A is symmetric. 


The proof of this is left to the reader. 

By (2.2c), A is closable; also denote the closure of A by A. According to 
Corollary 2.9, to prove that A is self-adjoint it suffices to prove that 
ker( A* + 7) = (0). Equivalently, it suffices to show that ran(A +7) Is 
dense. It will be shown that there are operators B , such that(A + 1)B,= 1, 
so that A + i is surjective. 

Notice that according to (5.10), 

(A +i)T, = AT, + iT, = i(Ty + T,) + i¢(0). 
So taking $(t) = —ie™‘', (A + i)T, = 1. According to (5.11), 
(A — i)S, = AS, — iS, = —i(Sy + S,) — i (0). 
Taking y(t) = ie‘, (A — i)S, = 1. Hence A is self-adjoint. 
Put V(t) = exp(iAr). It remains to show that V = U. Let he J. By 
Theorem 5.1(d), 
s"lV(tts)-V(t)]h =s3[V(s) —1]V(t)h > iAV(t)h; 
that is, V’(t)h = iAV(t)h. Similarly, 
s'|U(t+s)- U(t)|h =s7[U(s) —1]U(t)h > iAU(t)h. 
So if A(t) = U(t)h — Vi(t)h, then h: R — 5 1s differentiable and 
h(t) = iAU(t)h — iAV(t)h = iAh(t). 
But 


Met HO g¢045) nin), 


1 

slle(e + 8) — ACOIP = ( 
Thus (d/dt)||h(t)||? = 0 and so |jA||: R > R is a constant function. But 
h(O) = 0, so h(t) = 0. This says that U(t)h = V(t)h for all h in @ and all ¢ 
in R. Since J is dense, U = V. a 
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5.14. Definition. If U is a strongly continuous one-parameter unitary 
group, then the self-adjoint operator A such that U(t) = exp(itA) is called 
the infinitesimal generator of U. 


By virtue of Stone’s Theorem and Theorem 5.1, there is a one-to-one 
correspondence between self-adjoint operators and strongly continuous 
one-parameter unitary groups. Thus, it should be able to characterize 
certain properties of a group in terms of its infinitesimal generator and vice 
versa. For example, suppose the infinitesimal generator is bounded; what 
can be said about the group? 


5.15. Proposition. Jf U is a strongly continuous one-parameter unitary group 
with infinitesimal generator A, then A is bounded if and only if lim,_, || U(¢) 
— 1|| = 0. 


PROOF. First assume that A is bounded. Hence ||U(t) — 1|| = ||exp(itA) — 
1|| = sup{|e’* — 1]: x € o(A)} > 0 as ¢ = 0 since o( A) is compact. 

Now assume that ||U(t) — 1|| ~ 0 as t ~ 0. Let 0 < € < 7/4; then there 
is a t,>O such that ||U(t)—1|| <e for |t| <7). Since U(t)-1= 
faca(e’ — 1) dE(t), sup{le”! — 1[: x € 0(A)} = ||U(t) — 1] < € for Id 
< t . Thus for a small 6, tx © U%__  (2an — 6,2an + 6) = G whenever 
x € o(A) and |t| < ¢). In fact, if e is chosen sufficiently small, then 6 is 
small enough that the intervals {(2a7n — 6,2an + 6)} are the components 
of G. If x € o(A), {tx: 0 <t < t)} is the interval from 0 to ¢)x and is 
contained in G. Hence tx € (—6,8) for x in o(A) and |t| < ¢,. In particu- 
lar, t,0(A) C [—4, 6] so o( A) is compact and A is bounded. a 


Let p be a positive measure on R and let A, f= xf for fin 9, = {fe 
L?(p): xf € L?()}. We have already seen that A is self-adjoint. Clearly 
exp(itA,) = M, on L*(R), where e, is the function e,(x) = exp(itx). This 
can be generalized a bit. 


5.16. Proposition. Let (X, 0,1) be a o-finite measure space and let o be a 
real-valued §2-measurable function on X. If A = M, on L?(n) and U(t) = 
exp(itA), then U(t) = M,, where e,(x) = exp(ito(x)). 


Since each self-adjoint operator on a separable Hilbert space can be 
represented as a multiplication operator (Theorem 4.19), the preceding 
proposition gives a representation of all strongly continuous one-parameter 
semigroups. 


EXERCISES 
1. If U: R — &(#) is such that U(?t) is unitary for all ¢, U(s + t) = U(s)U(t) for 
all s, t, and U: R > (@(#), WOT) is continuous, then U is SOT-continuous. 


2. Show that for every integer n there is a continuously differentiable function 9, 
such that both ¢, and ¢ € L'(0, 0), $,(t) = 0 if > 1/n, and {5°¢,(¢) dt = 1. 
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3. Prove Claim 5.13. 


4. Adopt the notation from the proof of Stone’s Theorem. Let ¢,~ € & and show 
(a) T,* = Sj; (b) T,T, = T,,, and S,S, = Sy; (©) T,4 © AT. 


5. Let U be a strongly continuous one-parameter unitary group with infinitesimal 
generator A. Suppose e is a nonzero vector in # such that Ae = Ae. What is 
U(t)e? Conversely, suppose there is a nonzero ¢ such that U(t) has an eigenvec- 
tor. What can be said about A? U(s)? 


§6. The Fourier Transform and Differentiation 
Perhaps the best way to begin this section is by examining an example. 


6.1. Example. Let 9 = { f © L?(R): f is absolutely continuous on every 
bounded interval in R and f’ € L7(R)}. For f in J, let Af = if’. Then A 
is self-adjoint. | 

First let’s show that A is symmetric. If f€ Y, note that f(x) > 0 as 
x > +00 since f € L*(R). Soif f,g € 9,0 <a< a, 


if f(x)a@) ae = iL f(a)g(a)- f(-a)g(—a)] ~ if" f(x)8'(x) ax. 


Hence (Af, g) = (f, Ag) and A is symmetric. 

Now let g © dom A®* and for 0 < a < o let 9,={fEQ: f(x) = 0 for 
|x| = a}. The proof that g © dom A follows the lines of the argument used 
in Example 1.11. In fact, let h = A*g. So if fe B, then {f(x)h(x) dx = 
iff (x)g(x) dx. Let H(x) = f{ih(t) dt. Then using integration by parts we 
get that for f in @,, 


[° fh=H{a)f(a) -H(-a)f(~a) - ff 


fsa 


Therefore [¢,f’[H — (ig)] = 0 for every f in @,. As in (1.11), it follows 
that H — ig is constant on [—a,a] and g is absolutely continuous. More- 
over, 0 = H’ — ig’ =h — ig’; hence A*g = h = ig’. Thus g € J and A is 
self-adjoint. 

If A is the differentiation operator in Example 6.1, what is the group 
U(t) = exp(itA)? Since A is not represented as a multiplication operator, 
Proposition 5.16 cannot be applied. One could proceed to try and discover 
the spectral measure for A. Since A = {xdE(x), U(t) = fe'* dE(x). Or 
one could be clever. 

Later in this section it will be shown that if F: L7(R) —> L7(R) is the 
Fourier—Plancherel transform, then ¥ is a unitary operator (6.17) and 
ZF 'AF= the operator on L?(R) of multiplication by the independent 
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variable (6.18). Thus ¥ ‘U(t)F¥ is multiplication by e’*’. But it is possible 
to find U(t) directly. 
Recall that if f © dom A, 
Af = —ilim UG)f—f 
t-0 l 


So 


_— (U(t) f(x) = f(x) 
‘uae Se i 


Being clever, one might guess that (U(t)f)(x) = f(x — f). 


6.2. Theorem. /f A and @ are as in Example 6.1 and U(t) = exp(itA), then 
(U(t)f (x) = f(x — t) for all f in L?(R) and x,t in R. 


PROOF. Let (V(t) f)(x) = f(x — t). It is easy to see that V is a strongly 
continuous one-parameter unitary group. Let B be the infinitesimal genera- 
tor of V. It must be shown that B = A. 

Note that f © dom B if and only if lim,_,)¢ \(V(t)f — f) exists. Let 
f <= C(R); that is, f is continuously differentiable and has compact 
support. Thus for ¢ > 0, 


POP f) 2) ee rf £0) dy 


t 


and 


V(t) f(x) — fx) 
t 


Hoist f O)-f£oe 

< sup{|f’(x) — f(y) le — yl St}. 
Because f’ is continuous with compact support, f’ is uniformly continu- 
ous. Let K = {x: dist(x,spt f’) < 2}. So K is compact. For e > 0, let 
d(€) < 1 be such that if |x — y| < 6(e), then |[f(x) —f’(y)| < «. Hence 
We [VF — f+ fll, < e?|K], where |K| = the Lebesgue measure of K. Thus 
C(R) C dom B and Bf = Af for f in C{(R). But if f € dom A, there is a 
sequence { f,} ¢ C{(R) such that f, ® Af, > f ® Af in gra A (Exercise 1). 
But f, ® Af, =f, ® Bf, © gra B, so f ® Af © gra B; that is, A C B. Since 
self-adjoint operators are maximal symmetric operators (2.11), A = B. 
a 


To show that the Fourier transform demonstrates that M, and id/dx are 
unitarily equivalent, we introduce the Schwartz space of rapidly decreasing 
functions. 


6.3. Definition. A function ¢: R —> R is rapidly decreasing if ¢ 1s infinitely 
differentiable and for all integers m,n > 0, 


6.4 ISllnin = Sup{x 9 (x)]: x ER} < 00. 
Let “= /(R) be the set of all rapidly decreasing functions on R. 
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Note that if @¢ © /, then for all m,n > 0 there is a constant C,, , such 
that 


lo(x) Ss Cy nlx]. 


Thus if p is any polynomial and n > 0, |p(x)o“(x)| > 0 as |x| > 00. In 
fact, this is equivalent to ¢ belonging to Y (Exercise 3). Also note that if 
o © Y, then xo € F for all m,n > 0. 

It is not difficult to see that || - ||,,., is a seminorm on . Also, Y with 
all of these seminorms is a Frechet space (Exercise 2). The space / is 
sometimes called the Schwartz space after Laurent Schwartz. 


6.5. Proposition. /f 1 < p < 0, “SC L?(R). If 1 < p < oo, F is dense in 
L?(R); SY is weak-star dense in L™(R). 


PROOF. We already have that “C L™(R). If 1 < p < 00 and ¢ E FY, then 


fo lorax = fo (1+ x7) 71 + x?) 9)? dx 


< [I + x?) "o/h, f+ x2)? de. 
Since (1 + x7)? > 14 x’, 
Io, <o/(1 + x?) Ola 


1 
<7 /PTIlbllo.o + Pll2,0]. 


Since C‘~)(R) CY, the density statements are immediate. JM 


6.6. Definition. If f € L\(R), the Fourier transform of f is the function f 
defined by 


x 1 
f(x) = — | f(t)e"*' dt. 
= Fe AL 
Because f € L(R), this integral is well defined. 
The interested reader may want to peruse §VII.9, where the Fourier 
transform is presented in the more general context of locally compact 


abelian groups. That section will not be assumed here. 
Recall that if f, g € L’, then the convolution of f and g, 


f*e(x)=(20) 7 iE f(x — t)g(t) dt, 


belongs to L'(R) and ||f * gil, < |[fll,||gi|,. It is also true that if fe L?(R), 
l1<p so, then f*geL?(R) and |{f* gil, < lIfllpllgll, (see Exercise 4). 
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6.7. Theorem. (a) If f e L\(R), then f is a continuous function on R that 
vanishes at +00. Also, ||fllo < |lflh- 
(b) IfoES, bE YF. Also form,n > 0, 


68 (ix)"(2)"6 = [(2)"(-ix)"0)] 


(c) If f,g € L\(R), then (f *g) = fe. 
(Note: [ ] = the Fourier transform of the function defined in the brackets.) 


Proor. (a) The fact that f is continuous is an easy consequence of 
Lebesgue’s Dominated Convergence Theorem; it is clear that || floc =e | Ale 
For the other part of (a), let f = the characteristic function of the interval 
(a,b). Then f(x) = i(27)7 1x7 ![e7*’ — e7*4] > Oas |x| > 00. So f(x) 
vanishes at + 00 if f is a linear combination of such characteristic functions. 
The result for a general f follows by approximation. 

(b) It is convenient to introduce the notation Dé = ¢’. Thus D6 = @"””. 
Also in this proof, as in many others of this section, x will be used to 
denote the function whose value at ¢ is ¢ and it will also be used 
occasionally to denote the independent variable. 

If ¢€Y, then differentiation under the integral sign (Why is this 
justified?) gives 


(D8) 9) = =f (ine "(0 at 
= [(-ix)@] (>). 


By induction we get that for n > 0, 
(6.9) — D'% = [(-ix)"9]. 


Using integration by parts, 


(D$)"(y) = == ae ie e g(t) d 
a [i 9) Slew da 


e 'o(t) dt. 


That is, (D@) = (ix). By induction, 
6.10 (Dp) = (ix)"@ 
for all n => 0. Combining (6.9) and (6.10) gives (6.8). 
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By (6.8) if m,n > 0, then for ¢ in Y, 
ISlln.n = Sup{lx”(D"})(x)|: x © R} 


supl |= fe F)[-in*o(e)] a 
<del [gy eve 


< 
since D(x") € L'(R) (6.5). 
(c) This is an easy exercise in integration theory and is left to the reader. 
if 


ER 


The fact that f(x) —0 as |x| > co is called the Riemann—Lebesgue 
Lemma. 

The process now begins whereby it will be shown that the Fourier 
transform on L! \ L’ extends to a unitary operator on L*(R). Moreover, 
the adjoint of this unitary will be calculated and it will be shown that if 
id/dx is conjugated by this unitary, then the resulting self-adjoint operator 
is M,. 

Changing notation a little, let U, [instead of U(y)] denote the translation 
operator. Moreover, think of U, as operating on all of the L”? spaces, not 
just L*, so (U,f)(x) = f(x — y) for f in L?(R). Also, let e, be the function 
e, (x) = exp(ixy). 


6.11. Proposition. Jf f € L\(R) and y € R, then 
LGA] =e yf, 
lef] = UF, 
ProoF. If f € L'(R), 
[Uf] (x) = 22)? [UF] (de ar 
= (20) [f(t ye“ at 


= (20)? ff(s)e E+” ds 
=e_,(x)f(x). 


The proof of the other equation is left as an exercise. a 


In the proof of the next lemma the fact that /{ ee dt = ym is needed. 
Those who have never seen this can verify it by putting J = (oe dx, 
. 2 2 . . 
noting that [* = [</S°e~ “+” dx dy, and using polar coordinates. 
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6.12. Lemma. If ¢ > 0 and p,(t) = e~*", then 
1 ya 
Sey a /4e 
ev2 
Proor. Note that p, Ee S. By (6.8), Dp, = (—ixp,) . Using integration by 
parts, 


p.(x) = 


—] 0° 22 
Do.) (*) = === ee a 
(Dp,)(x) = == J 


ra —] oe aoe er 
-E lB) ede) 
ss se i? e © (—ix)e"™ dt 


2e*V2a 
xX , 
= x). 
52 p(x) 
Let y,(x) = e~*' 4". Then both 6, and y, satisfy the differential equation 
u(x) = —(x/2e7)u(x). Hence p, = cw, for some constant c. But W,(0) = 1, 


and 


(0) == fe Pat 


1 
a ae © 
6.13. Proposition. Jf y € L'(R) such that (27)7'/*{_w(x)dx = 1 and if 
for e > 0, p(x) =e W(x/e), then for every f in C)(R), ¥.* f(x) > f(x) 


uniformly on R. 


ProoF. Note that (27)7!/*/W,(x) dx =1 for all e > 0. Hence for any x 
in R, 
[ 


ber f(x) = f(x) = 20)? [L(x - 9) -s0)] -o(E) a 


= (20) 1? [[ f(x — se) — f(x)]¥(s) ds. 


Put w(y) = sup{|f(x — y) — f(x)|: y © R}. Now f is uniformly continu- 
ous (Why?), so if « > 0, then there is a 6 > O such that w(y) < e if |y| < 6. 
Thus w(y) > 0 as |y| — 0. Moreover, the inequality above implies 


Me ffl < Qn) "7? fao(se)y(s)] ds, 


Since y © L'(R), the Lebesgue Dominated Convergence Theorem implies 
that ||, *f— fll, ~ 9 whenever e, > 0. This proves the proposition. 
a 
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The next result is often called the Multiplication Formula. Remember that 
if fe L\(R), f © C,(IR). Hence fg € L'(IR) when both f and g € L'(R). 


6.14. Theorem. If f, g © L'(R), then 
[ F(x) g(x) dx = f f(~)8(2) ax. 
R R 


PROOF. The proof is an easy consequence of Fubini’s Theorem. In fact, if 
f,g € L'(R), then 


[Flo)alx)ax = f| AL frie vata) a 


= [rl0)| Fe fatxyeras| a 
=ff(:)at)ar. 


6.15. Inversion Formula. [f ¢ € Y, then 


(x) = o(r)e™ de. 


1 ore) 
i! 
PRroor. Let p,(x) =e ** and put W(x) = p,(x). Then by Lemma 6.12 
V(x) = e )(x/e) = 0,(x). Also, 


(20) 7"? fy(x) dx = (2m)? f" 2-M2—-2°/4 dy = 1, 
So ~,* h(x) — A(x) uniformly for any h in C,(R). If 6 EY, put f=¢ 


and g = e,p, in (6.14). By Proposition 6.11 and Lemma 6.12, ¢ = U.p, = 
Uy .. Thus 


1 Pre itx,,— et? 1 ee 
eI oe e d= J o(t)y (1 x) de 
= $*y,(x) 
> $(x) 


as e > 0. The Lebesgue Dominated Convergence Theorem implies the 
left-hand side converges to (27) '/*/@(t)e’*' dt and the theorem is proved. 
a 


In many ways the next result is a rephrasing of the preceding theorem. 
6.16. Theorem. If ¥: L— F is defined by Fo = $, F isa bijection with 


(F-'9)(x) = o(t)e dt. 


] 00 
ie ba 
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Moreover, if S is given the iopoleay’ induced by the seminorms {|| - ||,,. n° 
m,n > 0} that were defined in (6.4), F is a homeomorphism. 


PROOF. By (6.7b), AY C SY. The preceding theorem says that ¥ is bijective 
and gives the formula for ¥ '. The proof of the topological statement is left 
to the reader. ia 


6.17. Plancherel’s Theorem. [f ¢ € Y, then ||¢||, = ||¢||, and the Fourier 
transform ¥ extends to a unitary operator on L?(R). 


Proor. Let ¢ € ¥ and put v(x) = ¢(—x)..So p=o*VE ER) and 
6 = ow. An easy calculation shows that J = $; hence p = \o|?. Also, the 
Inversion Formula shows that p(0) = (27) '/*/6(x) dx = (27)7'” 
{|o(x)|? dx. Thus 


fidC)P de = (27)'0(0) 
= (20)'"o* ¥(0) 
= [o(x)¥ 0 - x) ax 


= flo(x)P ax. 


So if Y is considered as a subspace of L?(R), F, the Fourier transform, is 
an isometry on #. By Proposition 6.5 and the preceding theorem, ¥* 
extends to a unitary operator on L*(R). a 


Warning! The content of the Plancherel Theorem is that the Fourier 
transform extends to an isometry. The formula for this isometry is not given 
by the formula for the Fourier transform. Indeed, this formula does not 
make sense when f is not an L! function. However, the same symbol, # 
will be used to denote this unitary operator on L?(R). For emphasis it is 
called the Plancherel transform. 


6.18. Theorem. Let A be the operator on L*(R) given by Af = id/dx and let 
M be the operator defined by Mf=xf. If F: L7(R)—> L*(R) is the 
Plancherel Transform, then Adom M = dom A and 


FUAF= M. 


Proor. The fact that AA= AM on YF is an immediate consequence of 
Theorem 6.7(b). Since Y is dense in both dom A and dom M, the rest of the 
result follows (with some work— give the details). a 


Fourier analysis is a subject unto itself. One source is Sten and Weiss 
[1971]; another is Reed and Simon [1975]. 


X.7. Moments 349 


EXERCISES 


1. If D is as in Example 6.1, show that for every f in Z there is a sequence { f, } in 
CIR) such that f, > f and f’ > f’ in L?(R). 


2. Show that the Schwartz space “ with the seminorms {|| - || 
Frechet space. 


m,n>O}isa 


min 


3. If is infinitely differentiable on R, show that ¢ © ¥ if and only if for every 
integer n > 0 and every polynomial p, $'")(x) p(x) > 0 as |x| > oo. 


4. If fe L?(R),1 < p < «, and g € L'(R), show that f * g © L?(R) and ||f « gil, 
< |Ifllpllgil,. (Hint: See Dunford and Schwartz [1958], p. 530, Exercise 13 for a 
generalization of Minkowski’s Inequality.) 


5. If y and y, are as in Proposition 6.13 and fe L?(R), 1 < p < o, show that 
Ilf*¥, — fll, ~ 0 as e > 0. If fe L°(R), show that f+ yp, > f (weak*). 


6. If fe L'(R) and f © L'(R), show that f(x) = (27)7'}/7fg f(t)e'* dt ae. 


7. If #: L?(R) > L?(R) is the Plancherel Transform and f € L?(R), show that 
(Ff (x) = (Ff) -x). 


8. Show that ¥* = 1 but ¥* # 1. What does this say about o(F)? 
9. Find the Fourier transform of the Hermite polynomials. What do you think? 


§7. Moments 


To understand this section, the preceding two sections are unnecessary. 

Let pw be a positive Borel measure on R such that /{|¢|"du(t) =m, < 
for every n > 0. The numbers {m,,} are called the moments of p in analogy 
with the corresponding concept from mechanics. The central problem here, 
called the Hamburger moment problem, 1s to characterize those sequences of 
numbers that are moment sequences. Just as self-adjoint operators are 
connected to measures, the theory of self-adjoint operators is connected to 
the solution of this moment problem. 


7.1. Theorem. Jf {m,: n > 0} is a sequence of real numbers, the following 
statements are equivalent. 


(a) There is a positive regular Borel measure . on R such that {\t|" du(t) < oo 
for alln = 0 andm,, = {t" dp(t). 

(b) If a,...,a, € C, then L” ,_ m,, 0a, > 0. 

(c) There is a self-adjoint operator A and a vector e such that e € dom A" for 
alln and m,, = (A"e,e) for alln = 0. 


Before proving this theorem, a preliminary result is needed. This result is 
useful in many other situations and is one of the standard ways to show that 
a symmetric operator has a self-adjoint extension. 
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7.2. Proposition. Let T be a symmetric operator on # and suppose there is a 
function J: F°0—> # having the following properties: 


(a) J is conjugate linear (that is, J(h + g) =Jh + Jg and J(ah) = ash); 
(b) J* =1; 

(c) J is continuous; 

(d) JdomT C domT and TJ C JT. 


Then T has a self-adjoint extension. 


PROOF. First note that if h © dom7, then Jh © domT and h = J(Jh). 
Hence Jdom 7 = domT and JT = TJ. 

Let h © # and define L: #-—C by L(f) = ¢h, Jf). Since J is con- 
jugate linear, L is a linear functional. By (c), L is continuous. Thus there is 
a unique vector h* in # such that L(f) = (f,h*). Let J*h = h*. Thus 
J*: > #4 and 


7.3 (f, I*h) = (h, If). 


It is clear that J* is additive. If a € C, then (f/f, J*(ah)) = (ah, Jf) = 
ad f, J*h) = (f,aJ*h). Thus J* is conjugate linear. Since J* = 1, it 
follows that J*? = 1. 

Let he domT* and fe domT. Then (7/f,h) = (Jf, T*h) = 
(J*T *h, f) by (7.3). But also by (d), (TUf,h) = (JTf,h) = (J *h, Tf). 
So (J*T *h, f) = (J*h, Tf) for all h in dom7* and f in domT. But this 
says that J*h € dom7* whenever h € dom7™* and, furthermore, T *J *h 
= J*T *h. Since J ** = 1, it follows that J/*dom T* = domT* and J*T* 
aie ha a 

Now let h € ker(7* + i). Then T*J*h = J*T*h = J*(+ih) = +i *h. 
Thus J*ker(T* + i) C ker(T* #1). Since J** =1, J*ker(7* + i) = 
ker(7T * + 7). But J* is injective. Indeed, if J*h = 0, then h = J*(J*h) = 0. 
Thus the deficiency indices of T are equal. By Theorem 2.20, 7 has a 
self-adjoint extension. z 


PROOF OF THEOREM 7.1. (a) implies (b). If a@,...,a, € C, then 


n 
m4 oe, = f > a &,t/** du(t) 


7 = 


J 0 J, k=0 
= j| ye o| > aut du(t) 
j=0 k=0 
_ Z 
= {| Det) du(r) = 0, 
k=0 
(b) implies (c). Let %, = the collection of all finitely nonzero sequences 
of complex numbers {a,: n > 0}. That is, {a9,a,,...}€& #@ if a, EC 


for all n > O and a, = 0 for all but a finite number of values of n. If 


X.7. Moments 351 


x = {a,}, y= {B,} © % define [x, y] by 


7.4 [Ixy]= bh M54. 0 ;B x 
j.k=0 
It is easy to see that # is a vector space and (7.4) defines a semi-inner 
product on #%. In fact, it is routine that [-, -] is sesquilinear and condition 
(b) implies that [x, x] = 0 for all x in %. 
Let #% = {x © MH: [x, x] = 0} and let 7, be the quotient vector space 
Ho / Ho. Uh=x+ Hy and f=y+ #, = #, then 


7.5 (hf) =[x,y] 

can be verified to be a well-defined inner product on #%,. Let # be the 
Hilbert space obtained by completing #, with respect to the norm defined 
by the inner product (7.5). 

Now to define some operators. If x = {a,} © %, let Thx = 
{0, a, a, ...}. It is easy to check that 7 is a linear transformation on %. 
Also, if x = {a,}, y= {B,} ©, let Thx = {y,}. So yo = 0 and y, = 
a, ,1f n > 1. Hence 


Se 
t18 = 
Oo 


[T x, y] m5 41;Br, 


Mj.) By 


j=l 
k=0 
se — 
= = M 441% jB, 
j,k=0 
oa) 
a > M 4. By 4 
j=0 
k=1 
a [x, Toy]. 


In particular, if x © #5, then the preceding equation and the CBS in- 
equality imply that 
\[Zox, Tox ]| = | Tex, x|| < [Tex, T2x][x, x] 
= (). 

Hence 7)%, € 4. Thus 7) induces a linear transformation T on #, 
defined by T(x + %) = Tox + %. It follows that (Th, f) = ¢h, Tf) for 
all h,f in #,. Since #%, 15, by definition, dense in #, T is a densely 
defined symmetric operator on #. Now to show that T has a self-adjoint 
extension. 

Define Jp: # > # by J)({a,}) = (a, }. It is easy to see that Jp is 
conjugate linear and J = 1. Also, JyTy = TyJp. An easy calculation shows 
that [J,x, Joy] = [x, y] for all x, y in %. SoS, #4 C & and J, induces a 
conjugate linear function J,;: 4, > #, defined by Jj(x + %) = Jox + %. 
It follows that J,T = TJ,, J? = 1, and ||J,A|| = ||A|| for all 4 in 34. Thus 
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J, extends to a conjugate linear J: #7 # such that J? = 1 and ||Jh|| = ||Al| 
for all h in #. Hence J is continuous. Also, JdomT = J, 4, CH, = 
dom T and JJ C JT. By Proposition 7.2, T has a self-adjoint extension A. 

Let e, = {1,0,0, ...} € #. Hence 7Ty’e, has a 1 in the nth place and 
zeros elsewhere. If e = eg + %, then e © domT” C dom A” for all n > 0. 
Also, 


(At™e,e) = [ Tyeg, €o | =m, 


for n > 0. 

(c) implies (a). Let Y be the closed linear span of { A”e: n = 0}. For h in 
LO dom A, let Bh = Ah. It follows that B is a self-adjoint operator on ¥ 
and e is acyclic vector for B. By Theorem 4.18, there is a positive measure pt 
on R (because B is self-adjoint) such that /|t|”du(t) < oo for every n > 0 
and an isomorphism W: > L?(u) such that We = 1 and WBW' = M.. 
Thus 


fet du(t) = (Mr, 1) 


= (W'M"1,W~11) 
= (A"e,e) 


=mM,. a 


n 


EXERCISES 


1. (Stieltjes.) Let {m,,: n = 0} be a sequence of real numbers and show that the 
following statements are equivalent. (a) There is a positive regular Borel measure 
p on [0, co) such that m, = [t” du(t) for all n = 0. (b) If ay,...,a, € C, then 
vr p= 0M 440A, =O and  K=0M 441%), =O. (c) There is a self-adjoint 
eas A with o(A) C [0, 00) and a vector e in dom A” for all n > 0 such that 

= (A”e,e) for n > 0. 


2. (Bochner.) Let m: R — C be a function and show that the following statements 
are equivalent. (a) There is a finite positive measure » on R such that m(t) = 
je sar for all ¢ in R. (b) m is continuous and if ag,...,a, € C and 
ty,--.,t, © R, then L7 ,_ 9m(t; — t{,)a,;a, = 0. (c) There is a strongly continu- 
ous one-parameter unitary group U(t) and a vector e such that m(t) = 
(U(t)e,e) for all ¢. (Hint: Let 2 = all functions f: R — C that vanish off a 


finite set.) 


3. Let {m,: n © Z} € C and show that the following statements are equivalent. (a) 
There is a positive measure » on D such that m, = [z" du(z) for all n in Z. (b) 
If a_,,,.--,@_1,@,4,...,a, © C, then X” cen, _,@,@, > 0. (c) There is a 
unitary operator U and a vector e such that m, = (U"e,e) for all n. 


4. Show that the operator A that appears in the proof that (7.1b) implies (7.1c) 1s 
cyclic. 


CHAPTER XI 
Fredholm Theory 


This chapter is entirely independent of the preceding one and only tangen- 
tially dependent on Chapters VIII and IX. 

The purpose of this chapter is to study certain properties of operators on 
a Hilbert space that are invariant under compact perturbations. That is, we 
want to study properties of an operator A in @(#) that are also possessed 
by A + K for every K in #,(#). The correct view here is to consider this 
undertaking as a study of the quotient algebra B(#’)/B)(H#) = 
B/B,—the Calkin algebra. Any property associated with an element of the 
Calkin algebra is a property associated with a coset of operators and vice 
versa. It is useful—indeed essential—to relate these properties to the way in 
which the operators act on the underlying Hilbert space. 


§1. The Spectrum Revisited 


In Section VII.6 we saw several properties of the spectrum of an operator on 
a Banach space. In particular, the concepts of point spectrum, o,(A), and 
approximate point spectrum, o,,(A), were explored. It was also shown 
(VII.6.7) that do(A) C o,,(A). Recall that o,(A) is the left spectrum of A 
and o,(A) is the right spectrum of A. 


1.1. Proposition. Jf A © 4(#), the following statements are equivalent. 
(a) A €o,,(A); that is, inf{||(A — A)All: [Al] = 1} > 0. 

(b) ran{A — X) is closed and dimker(A — A) = 0. 

(c) A €o,(A). 
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(d) A € o,( A*). 
(e) ran(A* —A)=£#. 


PROOF. By Proposition VII.6.4, (a) and (b) are equivalent. Also, if B € 
B(#), then B(A — dX) = 1 if and only if (A* — A)B* = 1 so that (c) and 
(d) are easily seen to be equivalent. 

(b) implies (c). Let #4 = ran(A — A) and define T: #— 4 by Th = (A 
— A)h; then T is bijective. By the Open Mapping Theorem, T-': 4 > # 
is continuous. Define B: #- # by letting B = T~! on .@ and B =0on 
M*-. Then B € @(#) and B(A — A) = 1. (Note that we used a property 
of Hilbert spaces here; see Exercise VII.6.5.) 

(d) implies (e). Since X € o,(A*), there is an operator C in @(%#) such 
that (A* — X)C = 1. Hence #= (A* — A)CHC ran(A* — d). 

(e) implies (a). Let M=ker(A* —A)+ and define T: VW # by 
Th = (A* — X)h. Then T is bijective and hence invertible. Let C: #-> # 
be defined by Ch=T™'h. Then C#¥=W and (A* —A)C=1. Thus 
C*(A —A)=1 so that if he #H, ||h|| = ||C*(A — ADJA] < CFI ICA — 
A)Al|. Hence inf{||(A — AJA: ||Al| =1} = ||C*]|"’. 


IfAcc, AX ={d\: AEA}. 


1.2. Corollary. If A € B(#), then J0(A) C 6(A)N.0,(A) = 0,,(A) 9 
0,,)(A*)*. 


PROOF. The equality is immediate from the preceding theorem. In fact, 
a,(A) = 6,,(A) and o,(A) = o,(A*)* = 6,,(A*)*. If A © do(A), then 
(VII.6.7) A € o,,(A). But A © do(A*) so that A € o,,(A*). a 


For normal elements there is less variety. The pertinent result is proved 
here in a more general setting than that of operators. 


1.3. Proposition. Let & be a C*-algebra with identity. If a is a normal 
element of x, then the following statements are equivalent. 


(a) a is invertible. 
(b) a is left invertible. 
(c) a is right invertible. 


Proor. Assume that a is left invertible, so there is a b in & such that 
ba = 1. Thus for any x in &%, ||x|| = ||bax|| < ||b||||ax||, and hence ||ax|| > 
\|b|| ~*||x||. In particular, this is true whenever x € C*(a). Because a is 
normal, C*(a) is isomorphic to C(K) where K = o(a) and where the 
isomorphism takes a into the function z (z(w) = w). The inequality above 
thus becomes: ||z/|| > ||b|| ~'||/|| for every f in C(K ). It must be shown that 
0 € K (= o(a)). If 0 © K, then for every integer there is a function f, in 
C(K) such that O<f, <1, f(0)=1 and f,(z)=0 for z in K and 
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|z|) >n_'. Since O€ K, |If,|| = 1. But ||zf,|| < 1/n. This contradicts the 
inequality and so 0 € o(a); that is, a is invertible. 

The argument above shows that (b) implies (a). If a is right invertible, 
then a* is left invertible. By the preceding argument a* is invertible, and 
hence so is a. = 


1.4. Proposition. Jf N is a normal operator, then o(N) =0,(N) = 06,(N). 
If X is an isolated point of o(N), then \ € o,(N). 


PrRooF. The first part of the proposition is immediate from the preceding 
result. If A is an isolated point of o(N) and N = {zdE(z), then 0 # 
E({A}) = ker(N — A) (Exercise IX.2.1). | 


EXERCISES 

1. Let S be the unilateral shift of multiplicity 1 on /*(N) and find o,(S) and o,(S). 

2. The compression spectrum of A, o,( A), is defined by o.(A) = {A © C: ran(A — i) 
is not dense in #}. Show: (a) A ©o,(A) if and only if A €o,(A*). (b) 
o.(A) € o,(A), but this inclusion may be proper. (c) 9.(A) is not necessarily 
closed. (d) (A) = 9,,(A) U o.(A). 

3. If A € B(H) and f © Hol(A), then f(o,,(A)) € o,,(f(A)) and f(0,(A)) C 
0, ( f(A)). 

4. If f is a rational function with poles off o(A), show that f(o,,(A)) = 6,,(f(A)) 


and f(9,(A)) = 9,( f(A)). Give necessary and sufficient conditions on a function 
f in Hol(A) that these equalities hold. 


§2. The Essential Spectrum and Semi-Fredholm 
Operators 


Let B/&, be the Calkin algebra and let 7: # > B/B, be the natural map 
(3 is being suppressed here). Since &, is an ideal in #, #/B, is a Banach 
algebra with identity. 


2.1. Definition. If A © @(#%), then the essential spectrum of A, 0,(A), 1s 
the spectrum of 7(A) in @/&,; that is, o,(A) = o(7(A)). Similarly, the /eft 
and right essential spectrum of A are defined by o,,(A) = 0,(7(A)) and 
0,,.(A) = 0,(7(A)), respectively. 


2.2. Proposition. Let A € @( #7). 


(a) 0,.( A) U 0,.( A) = 0,( A). 
(b) o,,(A) = o,,(A*)*. 
(c) 0,.( A) = o,( A), 6,.( A) Cc 0,( A), and o,( A) Cc o( A). 
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(d) o,,(A), 6,.(4), and o,(A) are closed sets. 
(ec) If K © B(#), 6,,(A) = 9,,(A + K) and o,,(A) =9,,(A + K). 


PROOF. Parts (a), (b), (c), and (e) are trivial, and part (d) is a consequence of 
a general fact about Banach algebras. a 


In order to best appreciate and use the idea of essential spectrum, a better 
understanding of invertibility in @/BA, is needed. The following terminol- 
ogy is traditional. 


2.3. Definition. If 4 <€ @(#), Aisa left Fredholm operator if 7( A) is left 
invertible in @/%,; A is a right Fredholm operator if a(A) is right 
invertible in @/B,); A is a Fredholm operator if (A) is invertible in 
B/B,. Let F,, #,, AF denote the set of left Fredholm, right Fredholm, and 
Fredholm operators. So F= F¥,0 F,. Operators in the set PF =F, F 
are called semi-Fredholm operators. 


2.4. Proposition. The sets ¥,,.F4,, F are all open in B(H#’) andA EF, if 
and only if A* © #.. 


PROOF. Each of these sets is the inverse image under 7 of an open subset of 
B/#,. The other statement is trivial. a 


The next result, characterizing left Fredholm operators, is from Wolf 
[1959] and Fillmore, Stampfli, and Williams [1972]. 


2.5. Theorem. If A © 4(#), the following statements are equivalent. 


(a) A is a left Fredholm operator. 

(b) ran A is closed and dimker A < oo. 

(c) There is no sequence {h,,\ of unit vectors in # such that h,, > 0 weakly 
and l\im||Ah,|| = 0. 

(d) There is no orthonormal sequence {e,\ in # such that \im||Ae,|| = 0. 

(e) There isa 6 > 0 such that {h © #: ||Ah|| < d||h||} contains no infinite- 
dimensional manifold. 

(f) If the positive operator (A*A)'/? = {&tdE(t), then there is a 8 > 0 such 
that E[0, 6) is finite dimensional. 

(g) If K € &,(#), then dimker(A + K) < ©. 


PROOF. (a) implies (b). According to (a) there is a bounded operator B such 
that 7(B)7(A) = 1; that is, 7(BA — 1) = 0. Hence BA = 1+ K for some 
compact operator K. But kerA C ker BA = ker(1 + K). Since the eigen- 
spaces corresponding to nonzero eigenvalues of compact operators are finite 
dimensional, dimker A < oo. Also, the Fredholm Alternative (VII.7.9) im- 
plies ran BA = ran(K + 1) is closed. Hence there is a constant c > 0 such 
that for h L ker(BA), ||BAhA|| => c\|h||. Thus if A © [kerBA]~, cl|h|| < 
| BI || AA], or [LAA] = (c/||BI|)||Al|. This implies that A([ker BA] ~) is closed. 
But ran A = A(ker BA]*) + A(ker BA). Since A(ker BA) is finite dimen- 
sional, ran A is closed. 
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(b) implies (c). Let P be the projection of # onto (ker A)*+ . Since ran A 
is closed and A is a bijective map of (kerA)* onto ran A, there is a 
bounded operator B on # such that B = 0 on (ran A)* and BAh = h for 
h in (kerA)~. Thus BA = P. Now 1 — P has finite rank. So if {h,,} is a 
sequence of unit vectors such that h, — 0 weakly, then ||h, — Ph,|| — 0. 
But 1 = |[A,l? = | — P)A,|? + ||PA,I? = (| — P)A,|? + BAR, IP. 
Hence ||BAh,|| > 1 so that liminf]|Ah,|| > 0. 

(c) implies (d). Orthonormal sequences converge weakly to zero. 

(d) implies (e). If (e) is false, then for every positive integer n there is an 
infinite-dimensional subspace .4, such that ||Ah|| < (1/n)||h|| for all A in 
M,,. Let e, be a unit vector in .4,. Suppose e,,...,e, are orthonormal 
vectors such that e, ©. W@,,1<k <n. Let E be the projection of # onto 
V{e,,..-,e, }- If H,4,; A [e,,-.-,e,]~ = (0), then E is injective on 4, ,,. 
Since dim 4, ,, = o and dimran E < oo, this is impossible. Thus there is 
a unit vector e,,, in 4, ,, such that e,,, 1 {e,,...,e,}. The orthonor- 
mal sequence {e,,} shows that (d) does not hold. 

(e) implies (f). Let |A| = {tdE(t) and let 6 > 0. If h € E[0, 6], then 


\|Ahl|2 = (A*Ah, h) 
= (|A|*h, h) 
= [°C db, 4(0) < 8°E, ,[0,5] 


= 8°\|h\I’. 


So E[0, 6] {h: ||AA|| < 4||h||}. By (e) there is a 8 > O such that E[0, 6] 
is finite dimensional. 

(f) implies (a). Let Ws = { E[0,5]3}~ . Now |A| maps .Z, bijectively 
onto .@,. In fact, the inverse of |A|: 4; > Ws is ( [gt dE(t))|.Z5. Let 
A = U|A| be the polar decomposition of A. Since 4; C ran|A| = initial U, 
U maps .@, isometrically onto some closed subspace ¥ of ran A. Let 
V = the inverse of Uon ¥ and V=0 on #°~; that is, ViY*=0 and 
VIL= (U|M5) *. Hence V is a partial isometry. Let B, = [£°r~' dE(t) and 
put B=B.V. If he .@;, then BAh=BVU|A|h=h. If he a@sy= 
E[0,6]h, |A|h © Ws and so U\Aj|h 1 Y; thus BAh = 0. Hence BA = 
E(6, 0c) = 1 — E[0,6]. Since E[0, 5] has finite rank, 7(B)a7(A) = 1. 

(a) implies (g). If K is a compact operator, 7(A) = 7(A — K). Thus 
a(A — K) has a left inverse in the Calkin algebra. Since (a) implies (b), 
dim ker(A — K) < o. 

(g) implies (d). Suppose (d) does not hold. So there is an orthonormal 
sequence {e,} such that ||Ae,|| > 0. By passing to a subsequence if 
necessary, it may be assumed that £°_,||Ae,||* < oo. Thus for any h in #, 


Lida.e,diildentl < [Lidase, 2] | Divenll?] 7 


< C\lAll, 
where C = [S||Ae,||*]'/?. Thus Kh = L%_,¢(h,e,)Ae, defines a bounded 
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operator. Moreover, if K,h = L7_,(h,e,)Ae,, it is easy to see that ||K,, — 
K|| — 0. Thus K 1s compact. But (A — K )e, = 0 for every n, so dim ker( A 
= Ko) =": E 


Each of the parts of the preceding theorem can be used to give a 
statement equivalent to the fact that a point belongs to the left essential 
spectrum. Only one of these statements will receive such a translation. 


2.6. Corollary. 


(a) A € a,,(A) if and only if dimker(A — A) = 00 or ran(A — A) is not 
closed. 

(b) A € 0,,(A) if and only if dim[ran(A — A)]*= 0c or ran(A — A) is not 
closed. 


PrRooF. Part (a) is straightforward. Part (b) follows immediately from the 
facts that o,,(A) = 0,,(A*)* and that ran(A — A) is closed if and only if 
ran(A — A)* is closed (VI.1.10). is 


In order to prove part (b) of the preceding corollary it 1s not necessary to 
quote Theorem VI.1.10. For operators on a Hilbert space it is possible to 
give a direct proof that is easier than the Banach space case (see Exercise 2). 

The reader should compare Corollary 2.6 and Proposition 1.1. 


2.7. Proposition. If A © B(#), then o,,(A) = 9,.(A) U {A © o,( A): 
dim ker(A — A) < oo}. 


Proor. If A € o,,(A), then (1.1) either ran(A — A) is not closed or ker( A 
—))+#0. If ran(A — A) is not closed or if dimker(A — A) = 00, then 
A € o,,(A) by (2.6). The other inclusion is left to the reader. el 


2.8. Proposition. If N is a normal operator and X © o(N), then ran(N — A) 
is closed if and only if X is an isolated point of o(N). 


Proor. Assume X is an isolated point of o( NV); thus X = o(N)\ {A} is a 
closed subset of o(N). If N = {[zdE(z) and #, = E(X)#, then #, 
reduces N and o(N|#,) = X. Hence (N — A)#,_ is closed. Since #,* = 
ker(N — A), ran(N — A) = (N — A)#,; hence N — A has closed range. 

Now assume that A € o(N) but A is not an isolated point. Then there is 
a strictly decreasing sequence {7,,} of positive real numbers such that 7, > 0 
and such that each open annulus 4, = {z: 7,,, < |z — A| <7} has non- 
empty intersection with o( NV). Thus E(A,)## (0); let e, be a unit vector 
in E(A,)#. Then e, 1 ker(N — A) (= E({A})#) and 


z)<r°—>0. 


Sue 


(CN — A)e, I? =, Iz — A/?dE 
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That is, inf{||(N — A)Al|: ||Al| = 1, A L ker(N — A)} = 0 and so, by the 
Open Mapping Theorem, N — A does not have closed range. B 


2.9. Proposition. Jf N is a normal operator, then o,(N) = 6,,.(N) = 0,.(N) 
and o(N)\o0,(N)= {A € o(N): A is an isolated eigenvalue of N having 
finite multiplicity }. 


PrRooF. The first part follows by applying Proposition 1.3 to the Calkin 
algebra. If A is an isolated point of o( NV), then ran(N — A) is closed by the 
preceding proposition. So if dimker(N — A) < 00, A €0,,(N) = 0,(N) by 
Corollary 2.6. Conversely, if A € o(N)\o,(N), then ran(N — A) is closed 
and dimker(N — A) < oo. By the preceding proposition, A is an isolated 
point of o( AN). Thus A is an eigenvalue of finite multiplicity. al 


It is also worthwhile, before proceeding, explicitly to reformulate Theo- 
rem 2.5 to give a characterization of Fredholm and semi-Fredholm oper- 
ators. The proof is left to the reader. 


2.10. Proposition. 


(a) An operator A is a Fredholm operator if and only if ran A is closed and 
both ker A and ker A* = (ran A)~ are finite dimensional. 

(b) An operator A is a semi-Fredholm operator if and only if ran A is closed 
and either ker A or (ran A)~* is finite dimensional. 


2.11. Example. Let G be a bounded region in C and, to avoid pathologies, 
assume 0G = d[clG]. Let #= L7(G) (1.1.10) and define S: #-> # by 
(Sf)(z) = zf(z). Then o(S)=clG, o,(S) = 6,,(S8) = 4,,(S) = 0G = 
0,,(S), 6,(S) =O, and for A in G, ran(S — A) is closed and dim[ran(S — 
AyJt=1. 


To show that these statements are true, begin by proving: 


2.12 If \ € G,ran(S — A) = {fe L2(G): f(A) = 0} 


In fact, if h © L2(G), then [((S — A)A](z) = (z — A)h(z) so that f = (z 
— ))h vanishes at 4. Conversely, suppose f © L2(G) and f(A) = 0; then 
f(z) = (z — A)h(z) for some analytic function h on G. It must be shown 
that h © L2(G). Let r > 0 such that D = {z: |Z — A] <r} CG. Then 


ffoe= ffpme = fw 


Now ffplh|* < oo since h is bounded on D. For z in G\ D, |h(z)| = 
If(z)|/|z — A] < r7'If(z)|. Hence 


sf jules ae di < 00. 


Thus h € L2(G) and f = (S — A)h. This proves (2.12). 
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Using Corollary 1.1.12, f+» f(A) is a bounded linear functional on 
L2(G) whenever A € G. By (2.12), ran(S — A) is the kernel of this linear 
functional and hence is closed. 

Because G is bounded, the constant functions belong to L2(G). So if 
fELUG), f=[f—-f™A]+f(rA) and f— f(A) © ran(S — dA). Thus 
L2(G) = ran(S — 4) + C. Therefore dim{ran(S — A)]*+ = dim[L2(G)/ 
ran(S — A)] = 1 when AEG. 

If A&G, then S — A is not surjective; hence G C o(S). If A € clG, 
then (z — A)~! is a bounded analytic function on G. If Af=(z-—A)'f, 
then A is a bounded operator on L2(G) and it is easy to check that 
A(S — A) =(S —A)A = 1. Thus o(S) C clG. Combining these two con- 
tainments, we get o(S) = clG. 

From Proposition 2.10 we have that S —A is a Fredholm operator 
whenever A € G; thus GN o,(S) = 0. So0,(S) € 0G = o[clG]. If A & 0G, 
then A € do(S); thus A € o,,,(S) (1.2). Since ker(S — A) = (0), ran(S — A) 
is not closed. Thus 0G C o,,(S) Oo,,(S). This proves that o,(S) = 0,.(S) 
= 6,,.(S) = dG = o,,(S). 


EXERCISES 
1. Give a direct proof that (b) implies (a) in Theorem 2.5. 


2. If A € @(#) and ran A is closed, prove that ran A* is closed without using 
Theorem VI.1.10. [Hint: Show that there is a bounded operator B on # such 
that BA = the projection of # onto (ker A)* |] 


3. (Putnam [1968].) If A € @(#%), X € do(A), and A is not an isolated point of 
o(A), then ran(A — A) is not closed. Give an example of an operator A such 
that 0 is an isolated point of o(A) but ran A 1s not closed. 


4. Let G be a bounded region in C such that 0G = o[clG] and let ¢ be a function 
that is analytic in a neighborhood of clG. Define A: L2(G) > L2(G) by 
Af = $f. Find all of the parts of the spectrum of A. 


5. Let S be the unilateral shift and show that o(S)=64,(S)=clD, o,(S) = 
0,.(S) = 6,,(S) = OD, o,(S) =O, and for |A| < 1, ran(S — A) is closed with 
dim[ran(.S — A)]> = 1. 


6. Let S be the unilateral shift and put A = § @ S*. Find the parts of the 
spectrum of A. 


7. Let S be the unilateral shift and put A = S‘~). Show that o(A) = 0,(A) = 
o,,(A) = clD and o,,(A) = dD. 


8. Let A, B,C © B(H#H) and define X: #° > H™ by the matrix x=|4 wan 


C 
(a) Show that if A € F, then X¥ € F if and only if C € F. (b) If A © F, show 
that X © YF if and only if C € AF. (c) Suppose A,C € YF with dimker A 
= oo and dimkerC* = oo. Show that 0 € o,,(X)N4,.(X). 
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9. (Fillmore, Stampfli, and Williams [1972].) If A € o,.(A), then there is a projec- 
tion P, having infinite rank, such that 7(A — A)a(P) = 0. 


10. (Fillmore, Stampfli, and Williams [1972].) Let A € @(#). (a) If A has a cyclic 
vector e, show that dim{ Ae, A’e,...}4+ < 1. (b) Let A € o,,(A*). If e > 0, let 
fi, f. be orthonormal vectors such that ||(A* — A)f|| < « for 7 = 1,2 and let 
P = the projection onto V{f,, f,}. Put B=AP + (i — P)A. Show that ||B — 
A|| < 2. (c) Show that the noncyclic operators are dense in #(#%). 


§3. The Fredholm Index 


The author would like to acknowledge that James P. Williams made 
available to him a set of unpublished notes on the Fredholm index which 
formed the basis of this section. 

If A is a semi-Fredholm operator, define the (Fredholm) index of A, 
ind A, by 


3.1 ind A = dimker A — dim(ran A)~ 


= dimkerA — dimker A*. 


Note that ind A © Z U {+00} and it is necessary for either kerA or 
ker A* to be finite dimensional in order for (3.1) to make sense. For ind A to 
be well defined, it is not necessary that ran A be closed (the other part of the 
characterization of semi-Fredholm operators), but this property will be used 
in a critical way when the properties of the index are established. The main 
result of this section is the following. 


3.2. Theorem. Jf the set of semi-Fredholm operators, SF, has the relative 
norm topology from @(#) and ZU {+00} has the discrete topology, then 
ind: SA—> ZU {+o} is continuous. Moreover, if AG SF and K€ 
B (#), then ind A = ind(A + K). 


One of the uses of Theorem 3.2 is in the study of various integral and 
differential equations. More recently it has been used to study a variety of 
approximation questions in @(#) as well as several connections between 
topology and operator theory. 

Before proving Theorem 3.2, which will require a few lemmas, we will 
examine some additional properties of the index and a few examples. 

First observe that the Fredholm Alternative (VII.7.9) is an easy conse- 
quence of Theorem 3.2. Indeed, if A € C, A #0, then the operator A is 
invertible and so ind(A) = 0. If K € 4)(#), then (3.2) implies that ind(A 
— K)=0. Thus dimker(A — K) = dim[ran(A — K)]*+ and we have the 
Fredholm Alternative. 
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3.3. Proposition. 


(a) IfA © SF, then A* © SF and ind A = —ind A*. 
(b) If N is normal and N © SF, then N © F and ind N = 0. 


PROOF. (a) is clear. If N is normal and N © YF, then NE F,U-F#. By 
Proposition 1.3, N € ¥. Also, ||N*A|| = ||NA\|, so kerN = ker N*. Hence 
ind N = 0. = 


It is a good thing to keep in mind that if A © Y¥ and ind A is a finite 
number, then A €-¥ since both ker A and ker A* must be finite dimen- 
sional. 

The continuity statement in Theorem 3.2 has an easy interpretation. 
Because .YYF is an open set, its components are open. Since Z U {+00} 
nas the discrete topology, the continuity of the index is equivalent to the 
statement that the index is constant on the components of LY. This is 
quite useful in applications. 

One of the uses of the index is to examine ind(A — A) for all A for which 
this makes sense. When does it make sense? It must be that A -A © SF 
and this is true precisely when A € o,,(A) o,,(A). The next result is a 
consequence of (3.2). 


3.4. Proposition. Jf A € B(#), then ind(A — 2) is constant on the compo- 
nents of C \a,,(A)N0,,(A). If X is an isolated point of o(A) and XE 
0,,.(A) 0,,(A), then ind(A — A) = 0. 


PRooF. The map A +> A — A is a continuous map of C \ 9,,(A) N G,,(A) 
into YF. So the first part of the proposition follows from the preceding 
remarks. If A is an isolated point of o(A) and A €4,,(A) o,,(A), then 
there is a sequence {A,,} in C \ o( A) such that A,, > A. Thus ind(A4 — 4,,) 
— ind(A — A). Since ind(A — A,,) = 0 for all n, the result follows. i) 


3.5. Example. Let G be a bounded region in C such that dG = o[clG] and 
define S: L’a(G) > L2(G) by Sf=zf. Then o,,(S)4,,(S) = 0G and 
indvS — A) = —1 for A inG. If A € clG, S — A is invertible. 


In fact, in Example 2.11 it was shown that 0G = o,,(S) = o,,(S) = 0,(S), 
o,(S) =O, and dim[ran(S — d)] +=] for A inG. 


3.6. Example. Let S be the unilateral shift on /*. Then o,,(S5) 0 9,,(S) = 
oD and ind(S — A) = —1 for |Aj < 1. 

In Proposition VII.6.5 it was shown that o($) = clD, o,(S) = 0, and 
0,,>(S) = dD. Thus for |A| = 1, ran(S — A) is not closed and hence dD C 
0,.(S),,(S). Also, if |A] < 1, it was shown that ran(S — A) is closed and 
dim[ran(S — A)]*+ = 1. This implies that 0D = o,,(S) 16,,(S) and ind(S 
—X)= —-1forA mD. 
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3.7. Proposition. Jf A,B © F, then A® BEF and indA © B=indA 
+ ind B. 


PROOF. Exercise. 


Using this proposition and the preceding examples, more examples can be 
manufactured. Here is an interesting one. 


3.8. Example. Let S be the unilateral shift on /* and put A = S* @ S. 
Then o,,(A) 1 o,,(A) = OD, o( A) = clD, and for |A| < 1, ind(A — A) = 0. 


One of the most important properties of the index is contained in the next 
result. Note that Theorem 3.2 is not used in its proof so it can be used in the 
proof of (3.2). 


3.9. Theorem. /f A,B & ¥, then AB € F¥ and ind AB = ind A + ind B. 


PROOF. Since 7(F ) is the group of invertible elements of 4/4, it is clear 
that AB © ¥ whenever A and BEF. 

Clearly ker B C ker AB. Also, if h € ker AB, then Bh € kerA Q ran B. 
In fact, B maps ker AB onto ker A M ran B. Thus B induces a bijection of 
ker AB/ker B onto ker A ran B and so 


3.10 dim ker AB = dimkerB + dim[ker.A /O ran B]. 


(Note that because A, B &€ F, all of the dimensions that appear in (3.10) are 
finite integers.) 

Since ker A is finite dimensional, there is a finite-dimensional subspace 
of kerA such that “@N [kerA M ran B] = (0) and kerA =.@4+ kerAN 
ran B. Hence 


3.11 dimkerA = dim.4 + dim[ker.A M ran B]. 


It must be that @NM ran B = (0). In fact, MO ran B=.MN [kerA 
ran B] = (QO) since @ < ker A. Because dim.4 < oo, #+ ran B is closed 
(III.4.3). Let O = the projection of # onto (.@ + ran B)* =. @~* Nran Bt 
and let T = Q|ran B*; so T is surjective. If h € ker7, then h € ran Bt 
and 0 = Th = Qh; thus h € kerQ = (.@+ kerB)+*=.@+ kerB. Since 
h © ran B~, this implies that dimker T = dimZ. Since all of the spaces 
are finite dimensional, we have 


3.12 dimran B* = dim.# + dim[.@~* nran B+]. 
Now note that 
A(ran B) 0 A(.@*+* ran B+) = (0). 


In fact, if f © A(ranB)N A(@* Mran B~), then f = Ah = Ag, where 
h <ranB and g€.@~* ran B~. Thus A(h — g)=0, so h — g € kerA 
= M@+kerA O ranB. Let h- g=m-+k, where m€4@ and k € kerA 
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© ran B. Therefore (h — g,g) = (m+k,g) = (m,g) + (k,g) = 0. 


Hence 0 = (h, g) = (8, 8) = lgil’, so f = 0. 
Now we show that 


ran A = A(ranB) + A(.W@+ MranB*). 


In fact, #=(@+ ran B)®(.@+4+ ran B)+ =(.W+ ran B) @(.4+ 
Oran B~). Since 4 < ker A, we get the desired equality. 

Now an argument like that used to obtain (3.12), coupled with the fact 
that A(ran B) = ran AB, gives 


3.13 dimran AB*+ = dimran A+ +dim[.@+ Qran Bt]. 


We can now put the pieces of the puzzle together. Indeed, first using 
(3.13) and (3.10), we get 


ind AB = dimker AB — dimran AB~ 
= dimker B + dim[ker A M ran B] 


—{dimran A* +dim[.#+ Mran B*]} 
(3.11) = dimkerB + {dimker A — dim 4} 
—dimran A+ —dim[.@+ Mran B*| 
= ind A + dimkerB — {dim.#@ + dim[.4 + nran B*]} 
(3.12)=indA+indB. & 


3.14. Corollary. If A € F and R is an invertible operator, then RAR~' © F 
and ind RAR™! = ind A. 


We now begin to prove Theorem 3.2. If A © @(.#%), define 
y(A) = inf{||AA]|: [Al] = 1,4 + ker A}. 


3.15. Proposition. If A © @( 5), then y(A) = sup{y > 0: ||AAl| = yIJAll 
for allh 1 ker A} = inf{||AA||/||A||: 4 € ker A}. 


The proof of this proposition is left as an exercise. 


3.16. Proposition. Let A = @(#”). 


(a) y(A) > 0 if and only if ran A is closed. 
(b) y(A) = y(A*). 


PROOF. The proof of (a) has appeared several times in this book under 
different guises. The proof here is left to the reader. To see (b), let 
h 1 kerA. Then ||A*AA|| = |||A||A|A]| = ||AJAJAl|. But [AA © clran A* 
(Why?) = ker A*. Hence the definition of y(A) implies that ||A*AA|| > 
y(A)II|AIAll = yCA)I|AAl|; that is, |A*fl| > yCADIIfll for every f in ran A. 
Since ran A is dense in (ker A*)* , y(A*) > y(A). But A = A**, so y(A) 
> y(A*). ae 
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From here we get the following consequence. 


3.17. Corollary. Jf A € @(#), then ran A is closed if and only if ran A* is 
closed. 


3.18. Lemma. If 4,V < # and dim.4@> dim SN, then there is a vector m 
in M such that \\m|| = dist(m, ”). 


PROOF. Let P be the projection of # onto 4, so dim P(W”) < dim VW < 
dim .4. Thus P(./) is a proper subspace of 4; let mE MN P(V)~. If 
ne WN, then 0 = (Pn,m) = (n, Pm) = (n,m), som LW. Hence ||m|| 
= dist(m, WV’). = 


3.19. Lemma. [fh € #, then y(A)dist(h, ker A) < ||AAl|. 
Proor. Let P be the projection of # onto kerA*; then ||PA|| = 


2 


dist(h, ker A). Hence ||Ah|| = ||APA|| => y(A)||PA|| = y(A)dist(h, ker A). 
a 


The next result has some interest by itself as well as being a major 
stepping stone to the proof of Theorem 3.2. If the role of y(A) in the next 
and subsequent propositions impresses the reader as somewhat mysterious, 
reflect that if A is invertible, then y(A) = ||A~'||~! (Exercise 7). Now in 
Corollary VII.2.3, it was shown that if x is a Banach algebra, a, © #w, and 
bya) = 1, then a + b is left invertible whenever ||b|| < ||b,||~*. Of course, a 
similar result holds for right-invertible elements. The number y( <A) is trying 
to play the role of the reciprocal of the norm of a one-sided inverse. 

For example, if A is left invertible, then ran A is closed and ker A = (0); 
hence A € YF. The next result implies that if ||B|| < y(A), then A + B is 
left invertible. 


3.20. Proposition. [fA <€ SF and B © B(H#) such that ||B\| < y(A), then 
A+BESSF and: 


(a) dimker(A + B) < dimker A; 
(b) dimran(A + B)*+ < dimran A°*. 


PROOF. First note that because A € YF, y(A) > 0. 

If he ker(A + B) and h#0, then Ah = —Bh. By Lemma 3.19, 
y(A)dist(h, ker A) < ||BA|| < ||BI|\|A\| < yCA)||Al|.. Thus dist(h, ker A) < 
||2|| for every nonzero vector h in ker(A + B). By Lemma 3.18, (a) holds. 

Since ||Bl| = ||B*|| and y(A) = y(A*), (a) implies that dim ker( A* + 
B*) < dimker A*. But this inequality is equivalent to (b). 

It remains to prove that ran(A + B) is closed. Since A € YF, either 
dimkerA < co or dimker A* < oo. Suppose dimkerA < oo. It will be 
shown that A+ BeF¥, by using Theorem 2.5(e) and showing that if 
6 = y(A) — ||B||, then {h: ||(A + B)h|| < d||Al|} contains no infinite-dimen- 
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sional manifold. Indeed, if it did, it would contain a finite-dimensional 
subspace @ with dim.4@ > dimker A. By Lemma 3.18 there is a vector h in 
M with |h|| = dist(h, ker A). Now ||(A + B)A|| < d{/Al|, so Lemma 3.19 
implies that y(A)||h|| = y(A)dist(h, ker A) < ||AA|| < ||(A + B)A| + || BAI 
< (6 + ||BI])||A|| = yCA)||Aj], a contradiction. Thus 4 + BEY, and so 
ran(A + B) is closed. 

If dimker A = oo, then dimker A* < oo. The argument of the preceding 
paragraph gives that ran(A* + B*) is closed. By Corollary 3.17, ran(A + B) 
is closed. a 


3.21. Proposition. Jf A <= SF and ind A < 0, then there is a finite-rank 
operator F such that ker(A + F) = (0) and ind(A + F) = ind A. 


Proor. Since 0 > ind A = dimkerA — dimran At, dimkerA < o and 
dimker A < dimranA~. Let {e,,...,e,} be an orthonormal basis for 
ker A and let {f,,...,f,} be orthonormal vectors in ran A~. Define F: 
KH #H by Fh = Li _¢h,e,)f;. Thus F is a finite-rank partial isometry with 
initial F = ker A and final F < ran A.* 

If h © ker(A + F), then Ah = —Fh, hence Ah € ran A N ran A*. So 
0 = Ah = Fh; that is, h € kerA = initial F. So ||h|| = ||FA|| = 0, and, 
therefore, ker(A + F) = 0. Also, since ran F < ran A~, and initial F = 
ker A, ran(A + F) = ran A @ ran F. Thus ind(A + F) = 
—dim ran(A + F)* = —dim #60 [ran A @ ran F] = —dim ranA~ + 
dim ran F = ind A. a 


3.22. Corollary. If A is invertible and K © Bo, then ind(A + K) = 0. 


Proor. By considering A* + K* if necessary, it suffices to assume that 
ind(A + K) < 0. By the preceding proposition, there 1s a finite-rank oper- 
ator F such that ker(A + K + F) = (O) and ind(4 + K) = ind(A + K + 
F). Let L= K+ F. Since A+ L=A(1+A7!L), ker(1 + A7'L) = (0); 
thus —1¢0,(A~'L). But A~'L © @, so 1+ A 'L is invertible. By 
Theorem 3.9, ind(A + K) = ind(A + L) = ind A(1 + A7'L) = ind A + 
ind(1 + A~'L)= 0. i 


3.23. Corollary. Jf A < #, then the following statements are equivalent. 


(a) ind A = 0. 
(b) There is a compact operator K such that A + K is invertible. 
(c) There is a finite-rank operator F such that A + F is invertible. 


PROOF. (a) implies (c). By Proposition 3.21 there is a finite-rank operator F 
such that ker(A + F) = (0) and ind(A + F) = ind A = 0. Hence 
dim ran(A + F)*+= dimker(A + F)=0 and A + F is invertible. 

(c) implies (b). Clear. 

(b) implies (a). Apply Corollary 3.22 to A + K. ma 
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3.24. Proposition. Jf A = SF, kerA = (0), and B © BH) with |\|Bl| < 
y(A), then ker(A + B) = (0) and ind(A + B) = ind A. 


ProoFr. By Proposition 3.20, A+ BE SF, ker(A + B) = (0), and 
dimran(A + B)*+< dimranA*. It remains to show that dimran(A + 
B)+> dimran A+. 

Let n > 1 such that y(A) — ||B|| > ||Blln7*. ForO<k <n-—1, || - 
k/n)B\| < y(A). So Proposition 3.20 implies that A + (1 —k/n)BE SF 
and is injective. So if h € #, ||h|| = 1, then 


|4 +(1 - auue || Ah| -(1 - ala 


> y(A) — [Bl] > 0. 
Thus y(A + (1 — k/n)B) > y(A) — ||B|| > || — /n) Bi). Again, applying 
(3.20) to A + (1 — k/n)B and —(1/n)B, we have that 
A +(1 — 5 \B-<B=A +(1 -<*")a CSF 
n n n 


and 


ele t 
dim ran{ 4 +(1 — ‘ : : |B} < dim ran{ 4 +(1 — ag 


for 0 < k <n — 1. Looking at these n inequalities and noticing that the 
left-hand side for k = n — 1is dimran A* and that the right side for k = 0 
is dimran(A + B)~, we get that dimran A* < dimran(A + B)*. me 


3.25. Lemma. If A © SF and F is a finite-rank operator, then ind(A + F) 
= ind A. 


ProoF. If ind A = +00, then either ker A or ran A* is infinite dimensional. 
Because F has finite rank, the same is true of A + F. Thus ind A = ind( A 
+ F). Therefore it may be assumed that ind A is finite; that is, it may be 
assumed that A is a Fredholm operator. The proof is by cases. 

Case 1: kerF*+C kerA. Hence kerA*>C kerF=W%. So ranA = 
A(kerA*) C AW=(A+ F)WC ran(A + F). This implies that ran A+ D 
ran(A + F)~ and therefore 


3.26 dimran A+= dimran(A + F)~ +dim[ran(A + F) © ran A]. 


Also, ran A + ran F = A(kerA*) + F(kerF +) = ran(A + F) since ker A 
> kerF~. Since ran A C ran(A + F), ran(A + F) @ ran A and ran(A + 
F)/ran A are isomorphic as vector spaces. Also, the natural map of 
ran(A4 + F) onto ran(A + F)/ran A when restricted to ran F remains 
surjective. Thus ran F/ran FM ran A and ran(A + F)/ran A are isomor- 
phic as vector spaces. Combining these isomorphisms gives 


dim[ran(A + F) @ ran A] = dimran F — dim[ran FQ ran A]. 
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If we combine this with (3.26), we obtain 
3.27 dimran A+ = dimran(A + F)~ 


+ dim ran F — dim[ran FM ran A]. 


Since we want to show that ind A = ind(A + F), formula (3.27) demon- 
strates how dimran A+ and dimran(A + F')* differ. Now we must see 
how dimker A and dimker(A + F) differ. 

Note that 47 (ran F) = kerA ® [A ‘ran F M ker A+]. Hence 


3.28 dimkerA = dim A ‘(ran F) — dim[ A~'ran FO ker A*]. 


But A is injective on kerA+ and A[A'ran F 9 ker A*] = ran FO ran A. 
Thus 


3.29 dim[ A~'ran F 0 ker A+] = dim[ran F/O ran A]. 


Also, (A + F)[A tran F] = ran F. If h € ker(A + F), then Ah = — Fh, 
so that h € [A~'ran F]; that is, ker(A + F) C[A~'ran F]. Hence ker((A 
+ F)|[A~tran F]) = ker(A + F) and so 


dim[ A~'ran F] = dimker(A + F) + dimran F. 
If we combine this formula and (3.29) with formula (3.28), we obtain 
3.30 dim ker A = dimker(A + F) 
+ dimran F — dim[ran F 9 ran A]. 


Combining (3.27) and (3.30), it is clear that ind A = ind(A + F’). 

Case 2: ranF C ran A+. Hence kerf *~= ran F C kerA*. So Case 1 
implies that ind A = —ind A* = —ind(A* + F*) = ind(A + F). 

Case 3: kerF*C kerA*+ and ranF CranA. Let A, and F, be the 
operators defined from ker A~* into ran A by letting them be the restrictions 
of A and F to kerA~. We want to apply Corollary 3.22 to A, and F,. In 
fact, A,: ker A~— ran A is invertible, but there is a bit of a difficulty here 
since A, does not map a Hilbert space into itself. But this can be overcome 
since ker A* and ran A are isomorphic Hilbert spaces. (Why?) The details 
are left to the reader. By Corollary 3.22, ind( A, + F,) = 0. We now want to 
relate these dimensions to the corresponding dimensions for A and A + F. 

Since ker A C ker F, ker A C ker(A + F’). Thus ker( A, + F,) = ker(A + 
F) © ker A. Therefore, 


3.31 dim ker A = dimker(.A + F) — dimker( A, + F;). 


Also, since ker(A + F)+¢ kerAt+t, ran(A, + F,) = (A, + F,)kerA* = 
ran(A + F). Hence ran(A, + F,)* = ran A © ran(A + F). So 
dim ran(A, + F,)*= dimran A — dimran(A + F) = dimran(A + F)~ 
dim(ran A) * . (Why?) Therefore 


dim(ran A)~ = dimran(A + F)~ —dimran(A, + F,)~. 


Combining this equation with (3.31) gives that ind A = ind(A + F) — 
ind( A, + F,) = ind(A + F). 
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Case 4: The general case. Let A © ¥ and let F be a finite-rank operator. 
Let P be the projection of # onto ker A* and let Q be the projection of # 
onto ran A. So QFP is a finite-rank operator and kerQFP D ker A. Hence 
(ker OFP)+ Cc ker A* and clearly ran QFP C ran A. By Case 3, 


ind A = ind(A + QFP). 


Also, (1 — Q) FP is a finite-rank operator and ran(1 — Q)FP C ran A+ 
C [ran(A + QFP)]~*. So Case 2 implies 


ind(A + QFP) = ind(A + OFP +(1 — Q) FP) 
= ind(A + FP). 


But F(1 — P) has finite rank and [ker F(1 — P)|*C kerA C ker(A + 
FP). So Case 1 implies that 


ind(A + FP) 


ind(A + FP + F(1 — P)) 
ind((A4+F). 


PROOF OF THEOREM 3.2. The continuity of the index is the first order of 
business. Let A € SF and assume that indA < 0. It must be shown that 
there isa 6 > Osuchthatif C € YF and ||A — C|| < 6, thenind A = indC. 

By Proposition 3.21 there is a finite-rank operator F such that ker(A + 
F) = 0 and ind A = ind(A + F). Let 6 = y(A + F). By Proposition 3.24, 
if ||C — Ajj < 6, then ind(A + F) = ind(C + F). But Lemma 3.25 implies 
that indC = ind(C + F); thus ind A = indC if ||A — C|| < 6. Ifind A > 0, 
then the preceding argument shows that the index is continuous at A*. It 
follows that it is continuous at A. 

If K is a compact operator, let {F,} be a sequence of finite-rank 
operators such that ||F, — K|| — 0. By the first part of the proof, ind(A + 
F) > ind(A + K). But ind(A + F,) = ind A by Lemma 3.25. Hence ind A 
= ind(A + K). 2) 


For a more detailed treatment of the index applicable to unbounded 
operators on a Banach space, see pp. 229-244 of Kato [1966]. 


EXERCISES 
1. Prove Proposition 3.7. 
2. Verify the statements made in Example 3.8. 


3. If S is the unilateral shift, show that for every e > 0 there is a rank-one 
operator F with || F|| < e such that o(S* © S + F) = dD. 


4. Let G be an open connected subset of o(A)\o,,(A) Ua,,(A) and suppose 
Ay © G such that ind(A — A,) = 0. Show that there is a finite-rank operator F 
such that A + F — A, is invertible. Show that A + F — X is invertible for every 
A in G. 
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5. If A € @(#) and ranA is closed, show that ran A‘®? is closed. If A € SF 
and ker A = (0), show that A‘~’ © YF and ind A'~) = —o or 0. 


6. Prove Proposition 3.15. 
7. If A is invertible, show that y(A) = ||A~*||7?. 


8. Let A © ¥ and suppose f is analytic in a neighborhood of o(A) and does not 
vanish on o,(A). Show that f(A) © F and find ind f(A). 


9. Let S be the unilateral shift and let f be an analytic function in a neighborhood 
of clD such that f(z) # Oif |z| = 1. Let y(t) = f(expQzir)), 0 < t < 1. Show 
that o,(f(S)) = f(dD) = { y(t): 0 < ¢ < 1} and that if A € f( dD), ind(f(S) 
—X)= —n(y;A), where n(y; A) = the winding number of y about A. More- 
over, show that if ind( f(S) — A) = 0, then A € o( f(S)). 


10. Let S be the operator defined in Example 2.11 where G = D. Show that there is 
a compact operator K such that S + K is unitarily equivalent to the unilateral 
shift. 


11. Does the unilateral shift have a square root? 

12. Show that for every n in Z U {+00} there is an operator A in YF such that 
ind A =n. 

13. If A € YF, then for every n > 1, A” € YF and ind A” = n(ind A). 


§4. The Components of “F 


Since the index is continuous on YY and assumes every possible value 
(Exercise 3.12), AF cannot be connected. What are its components? 

Note that because YF is an open subset of a Banach space, its 
components are arcwise connected (Exercise [V.1.24). 


4.1. Theorem. If A,B ESF, then A and B belong to the same component 
of SF if and only if ind A = ind B. 


Half of this theorem is easy. For the other half we first prove a lemma. 


4.2. Lemma. If A € YF and ind A = 0, then there is a path y: [0,1] ~ SF 
such that y(0) = 1 and y(1) = A. 


PROOF. By Corollary 3.23 there is a finite-rank operator F such that A + F 
is invertible. If y(t)=A+4+tF, y(0)=A, yA)=4+ F, and y(t)E SF 
for all t. Thus we may assume that A is invertible. 

Let A = U|A| be the polar decomposition of A. Because A is invertible, 
U is a unitary operator and |A| is invertible. Using the Spectral Theorem, 
U = exp(iB) when B is hermitian. Also, since 0 € o(|A|), |A| = 
Sis. y* dE(x), where 0 < 6 <r = |All. Define y: [0,1] > #(#) by 


y(t) = cae x'dE(x) = e™*|Al'. 
[6,7] 
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It is easy to check that y is continuous, y(0) = 1, and y(1) = A. Also, each 
y(t) is invertible so y(t) © SF. S 


PROOF OF THEOREM 4.1. First assume that A, B © FY and ind A = ind B. So 
there is an operator C such that CB = 1 + K for some compact operator K. 
Thus C © & and indC = —ind B = —ind A. Hence AC € ¥ and ind AC 
= 0. By the preceding lemma there is a path y: [0,1] ~ YF such that 
y(0)=1 and y(1)=AC. Put p(t) = y(t)B — tAK. Because AK € &), 
e(t)&F for all ¢ in [0,1]. Also, p(0)= B and p(1) = ACB — AK = 
A+ K)—-AK=A. 

Now assume that ind A = — oo; so dim(ran A)* = o0 and dimkerA < 
co. Let F be a finite-rank operator such that ker(A + tF) = 0 for t ¥ 0. 
(Why does F exist?) This path shows that we may assume that ker A = (0). 
Let V be any isometry such that dim(ranV)* = oo and consider the polar 
decomposition A = U|A| of A. Since A © YF and kerA = (0), |A| is 
invertible and U is an isometry. Also, ranU = ran A, so (Exercise 4) there is 
a unitary operator W such that WUW* = JV. Let y: [0,1] — ¥ such that 
y(O) = |A| and y(1)=1 and let p: [0,1] such that p(0)=1 and 
p(1) = W. Then o(t) = p(t)Uy(t)p(t)* defines a path o: [0,1] ~- SF 
(Why?) such that o(0) = A and o(1) = V. Similarly, if ind B = — o, there 
is a path connecting B to V; so A and B belong to the same component of 
SEF 

If ind A = ind B = +00, apply the preceding paragraph to A* and B*. 

| 


4.3. Corollary. The component of the identity in ¥,FA,, is a normal sub- 
group of F and F/F, is an infinite cyclic group. 


Proor. By Theorem 3.9, ind. A— Z is a group homomorphism and it is 
surjective (Exercise 3.12). By Theorem 4.1, ker(ind) = F). ba 


EXERCISES 


1. Let G be any topological group and let G, be the component of the identity. 
Show that G, is a normal subgroup of G. 


2. What are the components of the set of invertible elements in C( dD)? 


3. If S = the unilateral shift, what are the components of the set of invertible 
elements of C*(S)? 


4. If V and U are isometries and dim(ranV)~ = dim(ranU)* , then there is a 
unitary W such that WUW* = V. 


5. Find the components of the set of partial isometries. Find the unitary equivalence 
classes of the set of partial isometries. 
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§5. A Finer Analysis of the Spectrum 


In this section we will examine the spectrum and the index more closely. For 
example, if Ad € YF, then Theorem 3.2 implies that there is a 6 > 0 such 
that if ||B — A|| < 6, then B © YF and ind B = ind A. How does dim ker B 
differ from dim ker A? In general a lot cannot be said; the next result is the 
best that can be said. 


5.1. Proposition. Jf A € SF and either ker A = (0) or ran A = #, then 
there is a 6 > O such that if ||B — A\| < 6, then dimker B = dimkerA and 
dim ran B = dimran A. 


PROOF. By Proposition 3.20 and Theorem 3.2 there is a 6 > 0 such that if 
||B — A|| < 6, then ind A = ind B, dimkerB < dimker A, and dimran B~ 
< dimran A~. Since one of these dimensions for A is 0, the proposition is 
proved. is 


If both kerA and ran A are nonzero, then there are semi-Fredholm 
operators B that are arbitrarily close to A such that dimker B < dimker A 
(see Exercise 1). In fact, just about anything that can go wrong here does go 
wrong. However, dimker(A — A) does behave rather nicely as a function 
of A. 


5.2. Theorem. If \ € 0,,(A)46,,(A), then there is a 6 >0 such that 
dim ker(A — p.) and dimran(A — p».)~* are constant for 0 < |p — Al < 4. 


PROOF. We may assume that A = 0, so A © YF. It follows that A” © SF 
for every n > 1 (Exercise 3.13). Hence ran A” =.%,, is closed. Let 4 = 

co, %,,. Note that 4,,, CW, and AW, =.%,,,,; hence AM CM. Let 
B=A\|A2. 


Claim. BZ = 7. 


If h € 4, then h € ran A and there is a unique vector f in (ker A) ~ such 
that Af=h. Nowhe.4@,,, =AM,, = A(%,, © ker A), so there is a vec- 
tor f, in 4, © kerA such that Af, = h. But the uniqueness of f implies 
that f=f, €.Z@,, for every n. Hence fe 4 and h = Af = BFE BM. 

Thus B © YF and ind B = dimkerB. By (3.2) and (3.20) there is a 
5 > 0 such that if |u| < 6, then dimker(B — p) < dimkerB, dimran(B — 
uw)+=0, and ind(B — pw) = ind B. Thus dimker(B — p) = dimkerB for 
|u| < 5. Also, choose 6 such that ind(A — p) = ind A for |p| < 6. 

On the other hand, if » # 0, then ker(A — pp) CZ. In fact, if h © ker(A 
—p), then A”h = ph, so that h = A"(p-"h) € M,, for every n. Thus for 
O < |p| < 6, dimker(A — pw) = dimker(B — p) = dimker B; that is, 
dim ker(A — p) is constant for 0 < |y| < 6. Since ind(A — p) is constant 
for these values of », dimran(A — p)~ is also constant. a 


The next result is from Putnam [1968]. 
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5.3. Theorem. Jf \ € do(A), then either X is an isolated point of o( A) or 
A € o,,(A) N4,,( A). 


PROOF. Suppose A © o(A) but A is neither an isolated point of o( A) nor a 
point of o,,(A) N a,,(A). It must be shown that A © int o( A). In fact, since 
dh €o,,(A)N0,,(A), A-AEG FF. Let &>0 such that A- pe SF, 
dim ker(A — p) = dimker(A — A), and dimran(A — »)~ = dimran(A — 
A)~ for 0 < |w — Al < 6. Since A € o( A), at least one of ker(A — A) and 
ran(A — »)~ differs from (0). Hence p € o( A) for |v — Al < 6. ls 


What happens if 4 is an isolated point of o( A)? 


5.4. Proposition. If is an isolated point of o( A), the following statements 
are equivalent. 


(a) NX € 0,,(A) No,,(A). 

(b) A is a pole of the function z > (z — A)7?. 
(c) The Riesz idempotent E(X) has finite rank. 
(d) A -—A EF and ind(A — A) = 0. 


PROOF. Exercise 3. 


Ifne ZU {+o} and A € &(#), define 
P,(A) = {X €0(A): A -A ESF and ind(A — A) =n}. 
So for n # 0, P(A) is an open subset of the plane; the set P)( A) consist of 
an open set together with some isolated points of o( A). In fact, Proposition 
5.4 can be used to show that P,( A) contains precisely the isolated points of 


o( A) for which the Riesz idempotent has finite rank. The proof of the next 
result is easy. 


5.5. Proposition. If A € @(#), then o,(A) = [o),(A) A o,,(A)] U 
P(A) U P_,(A). 


5.6. Definition. If A = 4(#), then the Weyl spectrum of A,o,(A), 1s 
defined by 


0,(A)=N{o(A+K): KE B}. 


Note that since o,(A + K) = o,(A) for every compact operator, o,,( A) is 
nonempty and o,(A) C o,(A). The way to think of the Weyl spectrum is 
that it is the largest part of the spectrum of A that remains unchanged 
under compact perturbations. It is clear that o,(4A) = 6,(A + K) for every 
K in &, but it is not so clear that o,,(A) C o( A). The following result of 
Schechter [1965] gives this and some more. 


5.7. Theorem. [f A © @(#), then o,(A) = 6,(A) UU, 4 )P,(A). 


PRooF. Clearly X = 6,(A) UU,.,)P,(A) € o,(A). Now suppose A € X. 
Then A —A © ¥ and ind(A — A) = 0. By Corollary 3.23 there is a finite- 
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rank operator F such that A + F — A is invertible. Hence A ¢ 0( A + F) 
so that A € o,,( A). a 


So for every operator A in @4(%) there is a spectral picture for A (a term 
coined in Pearcy [1978]). There are the open sets { P.,(A): 0 < |n| < oo}, the 
set P)(A) = G) UD where D consists of isolated points A for which 
dim F(A) = n, < oo, and there is the remainder of o( A), which is the set 
o,,(A)o,,(A). The next result is due to Conway [1977]. 


5.8. Proposition. Let K be a compact subset of C, let {G,: —c <n < o} 
be open subsets of K (some possibly empty), let D be a subset of the set of 
isolated points of K, and for each Xd in D letn, € {1,2, ...}. Then there is an 
operator A on # such that 0(A)=K, P(A)=G, for 0 < |n| < oo, 
P)(A) = Gp U D, and dim E(A) = n, for every X in D. 


We prove only a special case of this result; the general case 1s left to the 
reader. Let K be any compact subset of C and let G be an open subset of 
K. Put H = int[clG]; so GC H, but it may be that H # G. However, 
OH = d[cl H]. Let Tf = zf for f in L2(H), so H = P_,(T), o(T) = cl H, 
and o,,(T)O9,,.(T) = 0H = o[clG]. Let {A,} be a countable dense subset 
of K\ G and let N be the diagonalizable normal operator with o,(N ) = 
{A,} and such that dimker(N — A,) = 0 for each A,. If 0 <n < oo and 
A=N@®T™”, then o(A)=K, P_,(A)=G, and K\G=ao,(A)/ 
0,-( A). 


EXERCISES 


1. Let A © YF and suppose that ker A # (0) and ran A~* + (0). Show that for 
every 5 > O there is an operator B in YF such that ||B — A|| < 6, dimkerB < 
dim ker A, and dimran B+ < dimran At. 


2. If A € SF, show that there is a 6 > O such that dimker(A — ») = dimkerA 
and dimran(A — »)* = dimran A~ for |p| < 6 if and only if kerA C ran A” 
for every n > Ll. 


3. Prove Proposition 5.4. 
4. Prove Proposition 5.5. 


5. If A © 0P,(A) and n # 0, show that ran(A — A) is not closed. What happens if 
n= 0? 


6. (Stampfli [1974].) If A € @(#%), then there is a K in &)(#) such that 
o(A + K)=<a,(A). 


7. Prove Proposition 5.8. 


APPENDIX A 


Preliminaries 


As was Stated in the Preface, the prerequisites for understanding this book 
are a good course in measure and integration theory and, as a corequisite, 
analytic function theory. In this and the succeeding appendices an attempt 
is made to fill in some of the gaps and standardize some notation. These 
sections are not meant to be a substitute for serious study of these topics. 

In Section 1 of this appendix some results from infinite-dimensional 
linear algebra are set forth. Most of this is meant as review. Proposition 1.4, 
however, seems to be a fact that is not stressed or covered in courses but 
that is used often in functional analysis. Section 2 on topology is presented 
mainly to discuss nets. This topic is often not covered in the basic courses 
and it is especially useful in discussing various ideas and proving results in 
functional analysis. 


§1. Linear Algebra 


Let 2 be a vector space over F = R or C. A subset FE of & is linearly 
independent if for any finite subset {e,,...,e,} of E and for any finite set 
of scalars {a,,...,a@,}, if L2_,a,e, = 0, thena, = --- =a,=0.A Hamel 
basis is a maximal linearly independent subset of %. 


1.1. Proposition. Jf E is a linearly independent subset of #, then E is a 
Hamel basis if and only if every vector x in & can be written as x = Lip _ ,a,e, 
for scalars a,,...,a, and {e,,...,e,} CE. 


PROOF. Suppose EF is a basis and x © 2%, x € E. Then EU {x} is not 
linearly independent. Thus there are ay, a,,...,a, in F and e,,...,e, in E 


n n 
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such that 0 = ajx + ae, +--+: +a,e,, with a, #0. (Why?) Thus x = 
Leal Oy / My )e x: 

Conversely, if 2 is the linear span of E, then for every x in 2\ E, 
E U {x} is not linearly independent. Thus E isa basis. & 


1.2. Proposition. /f E, is a linearly independent subset of 2, then there is a 
basis E that contains Eo. 


PRooF. Use Zorn’s Lemma. 


A linear functional on & is a function f: 2 — F such that f(ax + By) = 
af(x)+ Bf(y) for x, y in Z and a, B in F. If & and ¥Y are vector spaces 
over F, a linear transformation from & into ¥Y is a function T: 2— ¥Y such 
that T(a,x, + a,x,) = a,T(x,) + a,T(x,) for x,,x, im 2% and a,,a, inF. 

If A,BC#,then A+B={a+b:aEA, be Bh}; A-B={a-b: 
acA, be B}. ForainF and AC%, aA ={aa: ac A}. lf Wisa 
linear manifold in & (that is, 4 C & and -@ 1s also a vector space with the 
same operations defined on 2%), then define 2/7 to be the collection of all 
the subsets of % of the form x + 4. A set of the form x + 74 is called a 
coset of M@. Note that (x +@4)+(yt+@)=(x+y)+@ and a(x +7) 
= ax +4 since 4 1s a linear manifold. Hence 2/Z becomes a vector 
space over F. It is called the quotient space of 2 mod %. 

Define QO: 2—> 2/M by O(x) =x +. It is easy to see that Q is a 
linear transformation. It is called the quotient map. 

If 7: &— ¥Y is a linear transformation, 


kerT = {x € %: Tx = 0}, 
fan = Tx xe 2}: 


ker 7 is the kernel of T and ranT is the range of T. If ranT = Y, T 1s 
surjective; 1f ker T = (0), T 1s injective. If T is both injective and surjective, 
then 7 is bijective. It is easy to see that the natural map 0: 4—- 2/7 1s 
surjective and kerQ = 4. 

Suppose now that 7: #— ¥ is a linear transformation and -@ 1s a linear 
manifold in 2. We want to define a map T: L/M > Y by T(x + M) = Tx. 
But T may not be well defined. To ensure that it is we must have Tx, = Tx, 
ifx,+@H=x,+ WZ. But x, +M=x,+7 if and only if x, — x, © 4, 
and Tx, = Tx, if and only if x, — x, @ kerT. So T is well defined if 
M < ker T. It is easy to check that if T is well defined, T is linear. 


1.3. Proposition. /f T: 2— Y is a linear transformation and M is a linear 


manifold in & contained in ker T, then there is a linear transformation T: 


4/M — Y such that the diagram 


es Yy 
o\, re T 
L/M 


commutes. 


A.2. Topology 377 


The preceding proposition is especially useful if 4 = ker T. In that case 
T 1s injective. 
The last proposition of this section will be quite helpful in the book. 


1.4. Proposition. Let f, f,,..., f, be linear functionals on 2. If kerf D 
(\;_,kerf,, then there are scalars a,,...,a, such that f = X}_,a,f, (that is, 


f(x) = Li_.a,f,(x) for every x in #). 


PROOF. It may be assumed without loss of generality that for 1 < k <n, 


() kerf, # () kerf. 
jel 


j#k 


(Why?). So for 1 <k <n, there is a y, in f\,,,kerf,; such that y, ¢ 
Ni _ kerf. So f(y,) = 0 for j # k, but f,(y,) # 0. Let x, = [f(x Ye. 
Hence f,(x,) = l and f(x,) = 0 for j # k. 

Now let f be as in the statement of the proposition and put a, = f(x,). 
If xed, let y=x—- Lif, (x)x,. Then f(y) = f(x) - 
Limiti(x)f(x,) = 0. By hypothesis, f(y) = 0. Thus 


0 = f(x)- x fy (x)f(x,) 


n 


= f(x) - pS af, (x); 


k=1 


equivalently, f= Li_,a,f,. 


§2. Topology 


In this book all topological spaces are assumed to be Hausdorff. 

This section will review some of the concepts and results using nets as this 
idea is frequently used in the text. 

A directed set is a partially ordered set (J, <) such that if i,,i, € J, then 
there is an i, in J such that i, >i, and i, >i,. A good example of a 
directed set is to let (X, 7) be a topological space and for a fixed x, in X 
let Y={U in FJ: x, EU}. If UVEY, define U>V if UC V (so 
bigger is smaller). Y is said to be ordered by reverse inclusion. Another 
example is found if S is any set and F is the collection of all finite subsets 
of S. Define F, > F, in F if F, D F, (bigger means bigger). Here F is said 
to be ordered by inclusion. Both of these examples are used frequently in the 
text. 

A net in X is a pair ((J, <), x), where (J, <) is a directed set and x isa 
function from J into X. Usually we will write x, instead of x(i) and will use 
the phrase “let {x,} be a net in X.” 
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Note that N, the natural numbers, is a directed set, so every sequence is a 
net. If (X, 7 ) is a topological space, x, © X, and W@= {U in 7: x, © U}, 
then let x,, © U for every U in YW. So {x,: UE MW} isa net in X. 


2.1. Definition. If {x,} is a net in a topological space X, then {x,} 
converges to x, (in symbols, x; > xX, or X9 = limx;) if for every open 
subset U of X such that x, © U, there is an ig = ig(U) such that x, © U 
for i > ig. The net clusters at x, (in symbols, x; -G> x,) if for every i) and 
for every open neighborhood U of x,, there exists an i >i, such that 
t-S.U, 


These notions generalize the corresponding concepts for sequences. Also, 
if x, > Xo, then x, ~~ Xo. Note that the net {x,: U © W} defined just 
prior to the definition converges to x). This 1s a very important example of a 
convergent Set. 


2.2. Proposition. Jf X is a topological space and A < X, then x E€ clA 
(closure of A) if and only if there is a net {a;} in A such that a, > x. 


Proor. Let Y= {U: U is open and x € U}. If x € cl A, then for each U 
in &@ there is a point a, in ANU. If UE &, then a, © U) for every 
U > U,; therefore x = lima,. Conversely, if {a,}1s a net in A and a, > x, 
then each U in Y contains a point a; and a, © AM U. Thus x EclA. 

a 


2.3. Proposition. Jf A C X, {a;} is a net in A, and a, ~y>x, then 
xEcla. 


PROOF. Exercise. 


There is a concept of a subnet of a net and with this concept it is possible 
to prove that if a net clusters at a point x, then there is a subnet that 
converges to x. The concept of a subnet is, however, somewhat technical 
and is not what you might at first think it should be. Since this concept is 
not used in this book, the interested reader is referred to Kelley [1955]. It 
might also be appropriate to mention that a topological space is Hausdorff 
if and only if each convergent net has a unique limit point. 


2.4. Proposition. If X and Y are topological spaces and f: X — Y, then f is 
continuous at x, if and only if f(x;) > f(X9) whenever x, > Xo. 


PROOF. First assume that f is continuous at x, and let {x;} be a net in X 
such that x, > x, in X. If V is open in Y and f(x,) € V, then there is an 
open set U in X such that x, © U and f(U) Cc V. Let ig be such that 
x, € U fori = iy. Hence f(x;) © V for i > ip. This says that f(x,;) > f(x9). 

Let Y= {U: U is open in X and x, € U}. Suppose f is not continuous 
at X,. Then there is an open subset V of Y such that f(x,)) © V and 
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f(U)\ V # O for every U in Y. Thus for each U in Y there is a point x, in 
U with f(x,) € V. But {x,} is a net in X with x, —> x, and clearly 
{ f(xy)} cannot converge to f(Xq). i 


2.5. Proposition. Jf f: X — Y, fis continuous at x), and {x,} is a net in X 
that clusters at X9, then { f(x,)} clusters at f(X9). 


PROOF. Exercise. 


2.6. Proposition. Let K C X. Then K is compact if and only if each net in K 
has a cluster point in K. 


PROOF. Suppose that K is compact and let {x,;; i€ 7} be a net in K. For 
each i let F; = cl{x;: j = i}, so each F, 1s a closed subset of K. It will be 
shown that { F;: i © J} has the finite-intersection property. In fact, since J 
is directed, if 7,,...,i, © J, then there is ani > i,,...,i,. Thus F, CMf_,F, 
and { F,} has the finite-intersection property. Because K is compact, there is 
an x, inf),F,.. But if U is open with x, in U and i, € J, the fact that 
Xo © cl{x,: i = ig}, implies there is an i > ig with x; in U. Thus x, => Xo. 

Now assume that each net in K has a cluster point in K. Let {K,: 
a € A} be a collection of relatively closed subsets of K having the finite- 
intersection property. If A= the collection of all finite subsets of A, order 
F by inclusion. By hypothesis, if F € F, there is a point x, in M{K,: 
a © F}. Thus {x,} 1s a net in K. By hypothesis, {x,} has a cluster point 
X,9 in K. Let a € A, so {a} © F. Thus if U is any open set containing x, 
there is an F in ¥ such that a € F and x, © U. Thus x, © UN K;,; that 
is, for each a in A and for every open set U containing x), UN K, # 0. 
Since K, is relatively closed, x) © K, for each a in A. Thus x, €1,K, 
and K must be compact. z 


The next result is used repeatedly in this book. 


2.7. Proposition. [f X is compact, {x,;} is a net in X, and x, is the only 
cluster point of {x;}, then the net {x,} converges to Xo. 


PROOF. Let U be an open neighborhood of x, and let J={jeT: 
x, € U}. If {x;} does not converge to x», then for every i in J there isa j 
in J such that j = i. In particular, J is also a directed set. Hence { x;: 
j © J} is a net in the compact set X \ U. Thus it has a cluster point y,. But 
the property of J mentioned before implies that yy is also a cluster point of 
{x,: i © I}, contradicting the assumption. Thus x; > Xo. a 


The next result is rather easy, but it will be used so often that it should be 
explicitly stated and proved. 


2.8. Proposition. Jf f: X — Y is bijective and continuous and X is compact, 
then f is a homeomorphism. 
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ProoF. If F is a closed subset of X, then F is compact. Thus f(F) is 
compact in Y and hence closed. Since f maps closed sets to closed sets, f~' 
is continuous. fa 


Note that the Hausdorff property was used in the preceding proof when 
we said that a compact subset of Y is closed. 

In the study of functional analysis it is often the case that the mathemati- 
clan is presented with a set that has two topologies. It is useful to know how 
properties of one topology relate to the other and when the two topologies 
are, in fact, one. 

If X is a set and 7,,.7, are two topologies on X, say that .7, is larger 
or stronger than 7, if 7, D7Z,; in this case you may also say that 7, 1s 
smaller or weaker. In the literature there is also an unfortunate nomencla- 
ture for these concepts; the words “finer” and “coarser” are used. 

The following result is easy to prove (it is an exercise) but it is enormously 
useful in discussing a set with two topologies. 


2.9. Lemma. If .7,,.7, are topologies on X, then 7, is larger than 7, if 
and only if the identity map i: (X, 7,) > (X, 7,) is continuous. 


2.10. Proposition. Let .7,,.7, be topologies on X and assume that 7, is 
larger than 7,. 


(a) If F is 7,-closed, F is 7,-closed. 

(b) If f: Y > (X, J) is continuous, then f: Y > (X, 7,) is continuous. 
(c) If f: (X, .7,) > Y is continuous, then f: (X, 7,) > Y is continuous. 
(d) If K is Z,-compact, then K is 7,-compact. 

(e) If X is F,-compact, then 7, = TZ). 


PROOF. (b) Note that f: Y > (X, .7,) is the composition of f: Y > (X, 7,) 
and i: (X,.7,) > (X, 7,) and use Lemma 2.9. 
(d) Use Lemma 2.9. 
(ce) Use Lemma 2.9 and Proposition 2.8. 
The remainder of the proof is an exercise. = 


APPENDIX B 
The Dual of L?( p)} 


In this section we will prove the following which appears as III.5.5 and 
III.5.6 in the text. 


Theorem. Let (X, 92, ) be a measure space, let 1 < p < 00, and let 1/p + 
1/q=1. Ifg © Lp), define F,: L?(w) > F by 


Ff) = ffedn, 


If 1 < p< o, the map g > F, defines an isometric isomorphism of L%) 
onto L?(p)*. If p=1 and (X,9,p) is o-finite, g > F, is an isometric 
isomorphism of L®() onto L'(p)*. 


Proor. If g © L4(u), then Hdélder’s Inequality implies that |F,(f)| < 
WIL plgll, for all f in L?(u). Hence F, € L?(w)* and ||F;|| < |lgll,- There- 
fore g*+> F, is a linear contraction. It must be shown that this map is 
surjective and an isometry. Assume F € L?(p)*. 

Case 1: w(X) < oo. Here x, © L?(p) for every A in Q. Define »(A) = 
F(x). It is easy to see that v is finitely additive. If {4,} CQ with 
A,24,2>.:-- andN?%_,4, =O, then 


1/p 
IXa,llp = | flxa,l? dn 


=(4,)? > 0. 


Hence »(A,,) > 0 since F is bounded. It follows by standard measure 
theory that v is a countably additive measure. Moreover, if u(A) = 0, 
x, = Oin L?(p); hence »(A) = 0. That is, » « pw. By the Radon—Nikodym 
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Theorem there is an {2-measurable function g such »(A) = {, gdp for every 
A in (2; that is, F(x,) = fx,g 4p for every A in &. It follows that 


B.1 F(f) = [fgdu 
for every simple function f. 
B.2. Claim. g © L4(u) and ||gl|, < ||F'll- 


Note that once this claim is proven, the proof of Case 1 is complete. 
Indeed, (B.2) says that F, © L?(u)* and since F and F, agree on a dense 
subset of L?() (B.1), F = F,. Also, |lglly < IIFIl = llFall = llglly- 

To prove (B.2), let ¢ > 0 and put E£, = {x © X: |g(x)| <t}. If fe L?(p) 
such that f = 0 off £,, then there is a sequence { f, } of simple functions 
such that for every n, f, = 0 off E,, |f,| < [f|, and f,(x) > f(x) ae. [pu]. 
(Why?) Thus |(f, — f)gl < 2¢/f| and fif|du = fifl-1du < Iifil,m¢ xy 
< oo. By the Lebesgue Dominated Convergence Theorem, F(f,) = {f,g du 
+ [fgdp. Also, If, —fP < 2?IfP. 80 Ilf, —fllp > 0; thus F(f,) > F(/). 
Combining these results we get that for any t > 0 and any f in L?(p) that 
vanishes off E,, (B.1) holds. 

Case la: 1 < p< 0.S01<q< o. Let f= x_~|g|7/g, where g(x) # 9, 
and put f(x) = 0 when g(x)= 0. If A = {x: g(x) # 0}, then 


ig" 
P = Eee Fees = q 
[ifr ap fi er ae J, sl du 


since pg — p = q. Therefore 
1/p 
J lel?du = ffedu = F(f) < IFIIIAll, = in| J ai? | 
E, bE, 
Thus 


1—1l1/p 


Fi] = 


f lel? dp 


l/q 
> 4d 
[te 7 
Letting ¢ > 00 gives that ||g||, < ||F'l- 
Case 1b: p= 1. So q= 0. For € > 0 let A = {x: |g(x)| > ||FI| + €}. 
For t > 0 let f= X¢ 9 48/lg|- Then |[f],; = «(A 9 E,), and so 


IFiln(A 0 £,) > ffgdu= fo gid > (WF + e)u(4 9 E)). 


Letting t > oo we get that ||Fi|u( 4) = (||FI| + ©)u(4), which can only be if 
H(A) = 0. Thus [gi], < || Fl. 

Case 2: (X,{,) is arbitrary. Let €= all of the sets EF in @ such that 
u(E) < oo. For E in Q let Q, = {A © Q: AC EF} and define (w|E)(4) = 
u(A) for A in Q,. Put L?(pjE) = L?(E,Q,, wlE) and notice that L?(p|E) 
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can be identified in a natural way with the functions in L?(X, 2, ») that 
vanish off FE. Make this identification and consider the restriction of F: 
L?(p) > F to L?(p|E); denote the restriction by F,: L?(p|E) > F. Clearly 
F, is bounded and ||F;|| < ||F\| for every E in @. 

By Case 1, for every E in & there is a g, in L4(p|E) such that for f in 
L?(ulE), 


B.3 F(f) = j fgx.du and |Igelly < WIFI 


If D, E € &, then L?(p|D OM E) is contained in both ae and L’(ylE). 
Moreover, Fp|L?(u|D 0 E) = F,|L?(p|D 0 E) = Fong. Hence gp = gy 
= Zpnz ae. [wu] on DO E. Thus, a function g can be defined on U{ E: 
E € @} by letting g = g, on E; put g =O off Uf FE: FE € &}. A difficulty 
arises here in trying to show that g is measurable. 

Case 2a: I < p < oo. Put o = sup{|[gzll,; EL & &}; so o < ||F\| < co. 
Since ||Zpll, < |lZzll, 1f D SG E, there is a sequence en } in & such that 
| See Sane for all n and Iz l|, 79. Let G=UR_LE,. If EE and 
EG =Q, then |[Zeue ll = Ig ell4 F | Sello=? lgalle ; "04: thus g, = 0. 
Therefore g = 0 off G and clearly g is measurable. Moreover, ge Lp) 
with ||g\|, = 9. 

If fe L?(p), then {x: f(x) #0} =U". D, where D,€ € and Dc 
D,,,, for all n. Thus xp f > fin L?(u) and so F(f) = lim F(x p f) = (B.3) 
lim fp, gfdu = Jgfdu. Thus F = F, and ||Fl| = ||Fell = lIgll < ¢ < IIA. 

Case 2b: p = 00 and (X, (2, ) is o-finite. This is left to the reader. be 


EXERCISE 


Look at the proof of the theorem and see if you can represent L’(X, 2, )* for an 
arbitrary measure space. 


APPENDIX C 
The Dual of C,(X) 


The purpose of this section is to show that the dual of C)( X) is the space of 
regular Borel measures on X and to put this result, and the accompanying 
definitions, in the context of complex-valued measures and functions. 

Let X be any set and let {2 be a o-algebra of subsets of X; so (X, (2) is a 
measurable space. If w is a countably additive function defined on {2 such 
that u(O) = 0 and 0 < p(A) < o& for all A in Q2, call p a positive measure 
on (X, 92); (X, 2, 1) is called a measure space. 

If (.X, (2) is a measurable space, a signed measure is a countably additive 
function p defined on 92 such that u(Q) = 0 and yp takes its values in 
R U {+00}. (Note: » can assume only one of the values +00.) It is 
assumed that the reader is familiar with the following result. 


C.1. Hahn—Jordan Decomposition. Jf is a signed measure on (X, 92), then 
LL = Py — By, Where pw, and p, are positive measures, and X = E, U E,, 
Where E,, FE, € 0, E,O £,=0, p,(£,) = 0 = p,(£,). The measures p, 
and j., are unique and the sets E, and E, are unique up to sets of b, + p 
measure zero. 

A measure (or complex-valued measure) is a complex-valued function pu 
defined on 92 that is countably additive and such that »(Q) = 0. Note that p 
does not assume any infinite values. If 4 is a measure, then (Rep)(A) = 
Re((4)) is a signed measure, as is (Imp)(A) = Im(p(A)); hence p = Rep 
+ iIlmup. Applying (C.1) to Rew and Im p we get 


C.2 b= (py — Ba) + iy — Ba) 


where pw, (1 <j < 4) are positive measures, p, 1m, (@, and my, are 
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mutually singular) and pw, L pb,. (C.2) will also be called the Hahn—Jordan 
decomposition of wp. 


C.3. Definition. If u is a measure on (X,2) and A € QQ, define the 
variation of p, |p|, by 


m 


l(a) = up| » lH (&;,)I: { E,}, is a measurable partition ofa}. 


j=l 
C.4. Proposition. Jf u is a measure on (X,2), then |p| is a positive finite 
measure on (X, 82). If is a signed measure, |p| is a positive measure. If 
(C.2) is satisfied, then |p|\(A) < Xf_ip,(A); if w is a signed measure, then 
[HM] = By + Mo. 

PROOF. Clearly |u|(4) > 0. Let {A,} be pairwise disjoint measurable sets 
and let A = U7_,4,, If e > 0, then there is a measurable partition { E;}”" | 
of A such that |y|(A) — e < L7,|y(E,)|. Hence 


x u(E; O A,,) 


n=1 


H(A) -e< y 


j=l 


But (£,0 A,,} 7, is a partition of A,, so |u|(A) — e < L°_,|u/(A,,). There- 
fore |u|(4) < L°°_,|u|(A,,). For the reverse inequality we may assume that 
|u|(4) < oo. It follows that |u|(A,) < oo for every n. (Why?) Let « > 0 and 
for each n>1 let {E{”,...,E°} be a partition of A, such that 
Ele E\)| > [pl(A,,) — /2”. Then 


d (i(A,,) < d 


at Lim zy”) 
J 


<e+ |uj(A). 


Letting N — oo and e > 0 gives that UP? |yl(A,) < |pl(A). 

Clearly |u(4)| < L7_,4,(A), so |u| < Xt_p,. It is left to the reader to 
show that |u| = , + pw, if w is a signed measure. Since py, 115, 3, fg are all 
finite, || is finite if uw is complex-valued. aw 


C.5. Definition. If «4 is a measure on (X, 2) and pv is a positive measure on 
(X, 12), say that p is absolutely continuous with respect to v (wu < p) if 
u(4) = 0 whenever »(A) = 0. If » is complex-valued, » «< v means p. < |p|. 
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C.6. Proposition. Let be a measure and v a positive measure on (X, 2). 
The following statements are equivalent. 


(a)u<-». 
(b) [ul < ». 
(c) If (C.2) holds, p,<v forl<k <4. 


PROOF. Exercise. 


The Radon—Nikodym Theorem can now be proved for complex-valued 
measures pw by using (C.6) and applying the usual theorem to the real and 
imaginary parts of p. The details are left to the reader. 


C.7. Radon—Nikodym Theorem. Jf (X, 02, v) is a o-finite measure space and 
pw is a complex-valued measure on (X, 82) such that w « v, then there is a 
unique complex-valued function f in L'( X,Q,v) such that p(A) = [,fdv for 
every A in QQ. 


The function f obtained in (C.7) is called the Radon—Nikodym derivative 
of u with respect to v and is denoted by f = du/dv. 


C.8. Theorem. Let (X,92,v) be a o-finite measure space and let w be a 
complex-valued measure on ( X, 82) such that p « v and let f = du/dv. 


(a) If g © L1(X,Q, |pI), then gf € L'(X,Q,v) and {gdp = [gfdv. 
(b) For A in 82, |p\(A) = fal fl de. 


PRooF. Part (a) follows from the corresponding result for signed measures 
by using (C.2) and a similar decomposition for /. 
To prove (b), let { £;} be a measurable partition of A. Then 


Lime) sd f ifiay =f flav. 

J ji OF A 
For the reverse inequality, let g(x) = f(x)/|f(x)| if x € A and f(x) # 0; 
let g(x) = 0 otherwise. Let {g,} be a sequence of {2-measurable simple 
functions such that g,(x) = 0 off A, |g,| < |g| < 1, and g,(x) > g(x) ae. 
[v]. Thus fg, > |flxa ae. [v]. Also, |fg,| < Iflxs and fx, © Li(v) [see 
(C.2)]. By the Lebesgue Dominated Convergence Theorem, /fg,dv > 
[lf dv. If g, = L,a;x 2, where {£;} is a partition of A and |a,| < 1, then 


f8n4v| = |[8,dul = [Xa e(E,)| < |w[(A). Thus f,if|dv < |u|(4). 


One way of phrasing (C.8b) is that |du/dv| = d|p|/dv. The next result is 
left to the reader. 


C.9. Corollary. Jf w is a complex-valued measure on (X,{2), then there is an 
Q-measurable function f on X such that |f| = 1 a.e. [|u|] and w(A) = J, fdlu| 
for each A in &. 
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C.10. Definition. Let X be a locally compact space and let Q be the 
smallest o-algebra of subsets of X that contains the open sets. Sets in Q are 
called Borel sets. A positive measure p on (X,{2) is a regular Borel measure 
if (a) w(K) < oo for every compact subset K of X; (b) for any E in QQ, 
uCE) = sup{u(K): KC E and K is compact}; (c) for any FE in QQ, 
w(£) = inf{u(U): UDE and U is open}. If pw is a complex-valued 
measure on (X, §2), pw is a regular Borel measure if |p| is. Let M(X) = all of 
the complex-valued regular Borel measures on X. Note that M(X) is a 
vector space over C. For p in M(X), let 


Cl ell = lel X). 


C.12. Proposition. (C.11) defines a norm on M(X). 


PROOF. Exercise. 


C.13. Lemma. [f » © M(X), define F.: C)(X) > C by Ff) = ffdp. 
Then F, © Co(X)* and ||F.l| = [lal 


Proor. If fe C)(X), then |Fi(f)| < ffl dlul < Illilul|. Hence Fe 
Cy( X)* and IF, < lal. 

To show equality, let fy be a Borel function such that |f,| = 1 ae. [|p] 
and u(A) = f, fod|y|. By Lusin’s Theorem, if ¢ > 0, there is a continuous 
function ¢ on X with compact support such that /|¢ — fol d|u| < e and 
ol] < suplfo(x)| = 1. Thus |ul| = ffofodlul (C8a) = ffodu = |[fodul < 
If(fo— o) dul + [fodp| < e + |F(o)| < € + |[F, || Hence ||u{| < ||F,I). 

= 


C.14. Corollary. (a) Jf U is an open subset of X and p € M(X), then 
|¥|(U) = sup{|{o dul: $ € C.(X), spt$ C U, and |\él| < 1}. (b) If » = 0, 
w(K) = inf{ fodu: ¢ € Co(X) and $= xx}. 


PROOF. (a) If U is given the relative topology from X, U is locally compact. 
Let v be the restriction of » to U. Then (a) becomes a restatement of (C.13) 
for the space U together with the fact that C.(U) is norm dense in C,(U). 

(b) If 6 > xx, then because p is positive, {ody > w(K). Thus w(K) < a 
= inf{ /odu: 6 © C)(X) and ¢ = x; }. Using the regularity of 4, for every 
integer n there is an open set U, such that K C U, and p(U,\ K) <n “l. 
Let y, © C,(X) such that 0 < y, <1, ~, = 1 0n K, and y, = 0 off U.. 
Thus ¥, > xx andsoa< fy, dus p(U,)<p(K)+n'. 


The next step in the process of representing bounded linear functionals 
on C,(X) by measures is to associate with each such functional a positive 
functional. If p © M(X), then the next lemma would associate with the 
functional Fi the positive functional J = Fy). 
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C.15. Lemma. [f F: C,(X) > C is a bounded linear functional, then there is 
a unique linear functional I: C,(X) > € such that if f& Cyo(X) and f = 0, 
then 


C.16 I(f) = sup{lF(g)|: g © Co(X) and |g| < f}. 
Moreover ||I\| = || Fil. 


PRooF. Let C)(X), be the positive functions in C)( X) and for f in C)( X), 
define J( f) as in(C.16). If a > 0, then clearly [(af) = al( f)if fE GCX),. 
Also, if g © Cy(X) and |g| <f, then |F(g)| < |JFllligll < \IFIllifl]. Hence 
I(f) S |IFIIMLMA < 0. 

Now we will show that /(f, + f,)=J(f,) + J(f,) whenever fi, f, € 
C(X),. Ife > 0, let 9), g. © Co( X) such that |g,| < f; and |F(g;)| > /(f) 
— 5e for j = 1,2. There are complex numbers B,, j = 1,2, with |6| = 1 
and F(g,) = B|F(g;)|. Thus 


I(f,\) + 1(f,) < e+ |F(g,)] + |F( 22) 
ca oa Bi F(g,) = BF(g>) 
ee IF( Big, + B>8>)). 


But (61g, + B)8)| < |gil + lgal <A +f. Hence I(f,)+ (fh) < e+ 
I(f, + f,). Since € was arbitrary, we have half of the desired equality. 


For the other half of the equality, let g © C)(X) such that |g| <f, +f, 
and I(f, + f,) < |F(g)| + & Let A, = min(|g|,f,) and A, = |g] — hy. 
Clearly h,,h, € C,(X),, 4, < fy, hy < fh, and h, + hy = |g|. Define g;: 
X > C by 

0 if g(x) = 0, 


g(x)=( h(x)g(x)  , 
“or if g(x) #0. 


It is left to the reader to verify that g, © Co(X) and g, + g, = |g|. Hence 
I(fi+f)< IF(g,) + F(g.)| + € 
< |F(g,)| + |F(g2)| + ¢ 
<I(fi)+ Mf) +e. 


Now let ¢ — 0. 

If f is a real-valued function in C)(X), then f= /, — f, where f,, fp © 
C,(X),. If also f = g, — g, for some gj), g, in Co(X),, then g, +f, =f, 
+ g,. By the preceding argument /(g,) + I(f,) = 1(f,) + 1(g2). Hence if 
we define J: ReC,(X) > R by I(f) = 1(f,) — 1(f,) where f=f, —f 
with f,;, in C)(X),, I is well defined. It is left to the reader to verify that 
T is R-linear. 

If fe C(x), then f=f, + if,, where f,, f, © ReCo(X). Let I(f) = 
I(f,) + uf, ). It is left to the reader to show that J: Cy( X) > C 1s a linear 
functional. 
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To prove that ||/|| = ||F\|, first let fe Cyo(X) and put J(f) = all(f)| 
where |a] = 1. Hence af = f, + if,, where f,, f, © ReC,(X). Thus |/(f)| 
= al(f) = I(f,) + Uf, ). Since |I(f)| 1s a positive real number, /( f,) = 0 
and /(/f,) = |J(/)|. But f; = Re(af) < |a/| = |f). Hence 


(Cf) s TA). 
From here we get, as in the beginning of this proof, that ||J|| < ||F'||. For the 
other half, if e > 0, let f © Cy( X) such that ||f]| < 1 and ||Fl| < |FCf)| + «. 
Thus ||F\| < 7(|f]) + € < |[J|j + «. ni 


C.17. Theorem. Jf J: C,(X) — C is a bounded linear functional such that 
I(f) = 0 whenever f © C,(X),, then there is a positive measure v in M(X) 
such that I( f) = [fdv for every f in Co(X) and |\I|| = v(X). 


The proof of this is an involved construction. Inspired by Corollary C.14, 
one defines »(U) for an open set U by 


v(U) = sup{I(¢): 6 & C(X),,¢@ <1, spto CU}. 
Then for any Borel set E, let 
vy(E) =inf{v(U): ECU and U is open}. 


It now must be shown that v is a positive measure and /(f) = {fdv. For 
the details see (12.36) in Hewitt and Stromberg [1975] or §56 in Halmos 
[1974]. Indeed, Theorem C.17 is often called the Riesz Representation 
Theorem. 


C.18. Riesz Representation Theorem. Jf X is a locally compact space and 
p © M(X), define F,: Cyo(X) > C by 


Ff) = {fdp. 


Then F, © Co(X)* and the map p> F, is an isometric isomorphism of 
M(X) onto C)(X)*. 


ProoF. The fact that p +> F, is an isometry is the content of Lemma C.13. 
It remains to show that p > F, is surjective. Let F © C)(X)* and define J: 
Co(X) > C as in Lemma C.15. By Theorem C.17, there is a positive 
measure v in M(X) such that J(f) = {fdv forall f inC,(X). If fe] CCX), 
then the definition of J implies that |F(f)| < 7(|f) = fif/dv. Thus, fr 
F(f) defines a bounded linear functional on C)(X) considered as a linear 
manifold in L'(v). Now C,(X) is dense in L'(v) (Why?), so F has a unique 
extension to a bounded linear functional on L'(v). By Theorem B.1 there is 
a function @ in L®(v) such that F(f) = {fodv for every f in C)(X) and 
lll, < 1. Let w(E) = f{,-¢dv for every Borel set E. Then p © M(X) and 
by Theorem C.8(a), F(f) = ffdp; that is, F = F,. 7 
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